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PREFACE 


The last two decades have witnessed an upsurge of interest and activity 
in graph theory, particularly among applied mathematicians and engineers. 
Clear evidence of this is to be found in an unprecedented growth in the 
number of papers and books being published in the field. In 1957 there was 
exactly one book on the subject (namely, Konig’s Theorie der Endlichen und 
Unendlichen Graphen). Now, sixteen years later, there are over two dozen 
textbooks on graph theory, and almost an equal number of proceedings of 
various seminars and conferences. 

Each book has its own strength and points of emphasis, depending on 
the axe (or the pen) the author has to grind. I have emphasized the com¬ 
putational and algorithmic aspects of graphs. This emphasis arises from the 
experience and conviction that whenever graph theory is applied to solving 
any practical problem (be it in electrical network analysis, in circuit layout, 
in data structures, in operations research, or in social sciences), it almost 
always leads to large graphs—graphs that are virtually impossible to analyze 
without the aid of the computer. An engineer often finds that those real-life 
problems that can be modeled into graphs small enough to be worked on by 
hand are also small enough to be solved by means other than graph theory. 
(In this respect graph theory is different from college algebra, elementary 
calculus, or complex variables.) In fact, the high-speed digital computer is 
one of the reasons for the recent growth of interest in graph theory. 

Convinced that a student of applied graph theory must learn to enlist 
the help of a digital computer for handling large graphs, I have emphasized 
algorithms and their efficiencies. In proving theorems, constructive proofs 
have been given preference over nonconstructive existence proofs. Chapter 
11, the largest in the book, is devoted entirely to computational aspects of 
graph theory, including graph-theoretic algorithms and samples of several 
tested computer programs for solving problems on graphs. I believe this 
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PREFACE 


approach has not been used in any of the earlier books on graph theory. 
The material covered in Chapter 11 and in many sections from other chapters 
is appearing for the first time in any textbook. 

Yet the applied and algorithmic aspect of this book has not been allowed 
to spoil the rigor and mathematical elegance of graph theory. Indeed, the 
book contains enough material for a course in “pure” graph theory. The 
book has been made as much self-contained as was possible. 

The level of presentation is appropriate for advanced undergraduate and 
first-year graduate students in all disciplines requiring graph theory. The 
book is organized so that the first half (Chapters 1 through 9) serves as 
essential and introductory material on graph theory. This portion requires 
no special background, except some elementary concepts from set theory 
and matrix algebra and, of course, a certain amount of mathematical matu¬ 
rity. Although the illustrations of applications are interwoven with the theory 
even in this portion, the examples selected are short and mostly of the nature 
of puzzles and games. This is done so that a student in almost any field can 
read and grasp the first half. 

The second half of the book is more advanced, and different chapters 
require different backgrounds as they deal with applications to nontrivial, 
real-world, complex problems in different fields. Keeping this in mind, 
Chapters 10 through 15 have been made independent of each other. One 
could study a later chapter without going through the earlier ones, provided 
the first nine chapters have been covered. 

Since there is more material here than what can be covered in a one- 
semester course, it is suggested that the contents be tailored to suit the 
requirements of the students in different disciplines, for example: 

1. Electrical Engineering: Chapters 1-9, and 11, 12, and 13. 

2. Computer Science: Chapters 1-9, 11, 12, and parts of 10 and 15. 

3. Operations Research: Chapters 1-9, and 11, 14, and parts of 15. 

4. Applied Mathematics: Chapters 1-11 and parts of 15. 

5. Introductory “pure” graph theory: Chapters 1-10. 

In fact, the book grew out of a number of such courses and lecture-series 
given by the author at the Jet Propulsion Laboratory, California State 
University at Los Angeles, the Indian Institute of Technology at Kanpur, 
and the University of Illinois at Urbana-Champaign. 
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INTRODUCTION 


1-1. WHAT IS A GRAPH? 

A linear t graph (or simply a graph) G — (V, E) consists of a set of objects 
V = {v x , v 2 , . . .) called vertices, and another set E — [e x , e 2 ,. . .}, whose 
elements are called edges, such that each edge e k is identified with an unorder¬ 
ed pair (v i , Vj) of vertices. The vertices v t , v } associated with edge e k are called 
the end vertices of e k . The most common representation of a graph is by means 
of a diagram, in which the vertices are represented as points and each edge 
as a line segment joining its end vertices. Often this diagram itself is referred to 
as the graph. The object shown in Fig. 1-1, for instance, is a graph. 

Observe that this definition permits an edge to be associated with a 
vertex pair (v n vf Such an edge having the same vertex as both its end ver¬ 
tices is called a self-loop (or simply a loop. The word loop, however, has a 
different meaning in electrical network theory; we shall therefore use the term 
self-loop to avoid confusion). Edge e x in Fig. 1-1 is a self-loop. Also note that 



Fig. 1-1 Graph with five vertices and 
seven edges. 


fThe adjective “linear” is dropped as redundant in our discussions, because by a graph 
we always mean a linear graph. There is no such thing as a nonlinear graph. 
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the definition allows more than one edge associated with a given pair of 
vertices, for example, edges e 4 and e 5 in Fig. 1-1. Such edges are referred to as 
parallel edges. 

A graph that has neither self-loops nor parallel edges is called a simple 
graph. In some graph-theory literature, a graph is defined to be only a simple 
graph, but in most engineering applications it is necessary that parallel edges 
and self-loops be allowed; this is why our definition includes graphs with self¬ 
loops and/or parallel edges. Some authors use the term general graph to 
emphasize that parallel edges and self-loops are allowed. 

It should also be noted that, in drawing a graph, it is immaterial whether 
the lines are drawn straight or curved, long or short: what is important is the 
incidence between the edges and vertices. For example, the two graphs drawn 
in Figs. l-2(a) and (b) are the same, because incidence between edges and 
vertices is the same in both cases. 



(a) (b) 

Fig. 1-2 Same graph drawn differently. 

In a diagram of a graph, sometimes two edges may seem to intersect at 
a point that does not represent a vertex, for example, edges e and /in Fig. 
1-3. Such edges should be thought of as being in different planes and thus 
having no common point. (Some authors break one of the two edges at such 
a crossing to emphasize this fact.) 


a 



c 


Fig. 1-3 Edges e and / have no common 
point. 
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A graph is also called a linear complex, a 1-complex , or a one-dimensional 
complex. A vertex is also referred to as a node, a junction, a point, 0-cell, or an 
0-simplex. Other terms used for an edge are a branch, a /me, an element, a 
1-cell, an m-c, and a 1-simplex. In this book we shall generally use the terms 
graph, vertex, and edge. 

1-2. APPLICATIONS OF GRAPHS 

Because of its inherent simplicity, graph theory has a very wide range of 
applications in engineering, in physical, social, and biological sciences, in 
linguistics, and in numerous other areas. A graph can be used to represent 
almost any physical situation involving discrete objects and a relationship 
among them. The following are four examples from among hundreds of such 
applications. 

Konigsberg Bridge Problem: The Konigsberg bridge problem is perhaps 
the best-known example in graph theory. It was a long-standing problem 
until solved by Leonhard Euler (1707-1783) in 1736, by means of a graph. 
Euler wrote the first paper ever in graph theory and thus became the origina¬ 
tor of the theory of graphs as well as of the rest of topology. The problem is 
depicted in Fig. 1-4. 

Two islands, C and D, formed by the Pregel River in Konigsberg (then 
the capital of East Prussia but now renamed Kaliningrad and in West Soviet 
Russia) were connected to each other and to the banks A and B with seven 
bridges, as shown in Fig. 1-4. The problem was to start at any of the four land 
areas of the city. A, B, C, or D, walk over each of the seven bridges exactly 
once, and return to the starting point (without swimming across the river, 
of course). 

Euler represented this situation by means of a graph, as shown in Fig. 
1-5. The vertices represent the land areas and the edges represent the bridges. 

As we shall see in Chapter 2, Euler proved that a solution for this problem 
does not exist. 



B 


Fig. 1-4 Konigsberg bridge problem. 
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B 


Fig. 1-5 Graph of Konigsberg bridge 
problem. 


The Konigsberg bridge problem is the same as the problem of drawing 
figures without lifting the pen from the paper and without retracing a line 
(Problems 2-1 and 2-2). We all have been confronted with such problems at 
one time or another. 

Utilities Problem: There are three houses (Fig. 1-6) H { , H 2 , and H 3 , each 
to be connected to each of the three utilities—water (W), gas (G), and elec- 



Fig. 1-6 Three-utilities problem. 

tricity (E)—by means of conduits. Is it possible to make such connections 
without any crossovers of the conduits? 

Figure 1-7 shows how this problem can be represented by a graph—the 
conduits are shown as edges while the houses and utility supply centers are 
vertices. As we shall see in Chapter 5, the graph in Fig. 1-7 cannot be drawn 
in the plane without edges crossing over. Thus the answer to the problem is 
no. 

Electrical Network Problems: Properties (such as transfer function and 
input impedance) of an electrical network are functions of only two factors: 

1. The nature and value of the elements forming the network, such as 
resistors, inductors, transistors, and so forth. 

2. The way these elements are connected together, that is, the topology 
of the network. 
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Fig. 1-7 Graph of three-utilities problem. 


Since there are only a few different types of electrical elements, the varia¬ 
tions in networks are chiefly due to the variations in topology. Thus electrical 
network analysis and synthesis are mainly the study of network topology. In 
the topological study of electrical networks, factor 2 is separated from 1 and 
is studied independently. The advantage of this approach will be clearer in 
Chapter 13, a chapter devoted solely to applying graph theory to electrical 
networks. 

The topology of a network is studied by means of its graph. In drawing 
a graph of an electrical network the junctions are represented by vertices, and 
branches (which consist of electrical elements) are represented by edges, 
regardless of the nature and size of the electrical elements. An electrical net¬ 
work and its graph are shown in Fig. 1-8. 


b b 



Fig. 1-8 Electrical network and its graph. 
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Seating Problem: Nine members of a new club meet each day for lunch 
at a round table. They decide to sit such that every member has different 
neighbors at each lunch. How many days can this arrangement last? 

This situation can be represented by a graph with nine vertices such that 
each vertex represents a member, and an edge joining two vertices represents 
the relationship of sitting next to each other. Figure 1-9 shows two possible 


9 



Fig. 1-9 Arrangements at a dinner 
table. 


seating arrangements—these are 123456789 1 (solid lines), and 13 5 2 7 4 
9 6 8 1 (dashed lines). It can be shown by graph-theoretic considerations that 
there are only two more arrangements possible. They are 1573928461 
and 1795836241. In general it can be shown that for n people the number 
of such possible arrangements is 


and 



if n is odd, 



if n is even. 


The reader has probably noticed that three of the four examples of ap¬ 
plications above are puzzles and not engineering problems. This was done 
to avoid introducing at this stage background material not pertinent to graph 
theory. More substantive applications will follow, particularly in the last 
four chapters. 


1-3. FINITE AND INFINITE GRAPHS 

Although in our definition of a graph neither the vertex set V nor the edge 
set E need be finite, in most of the theory and almost all applications these 
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sets are finite. A graph with a finite number of vertices as well as a finite num¬ 
ber of edges is called a finite graph; otherwise, it is an infinite graph. The 
graphs in Figs. 1-1, 1-2, 1-5, 1-7, and 1-8 are all examples of finite graphs. 
Portions of two infinite graphs are shown in Fig. 1-10. 



Fig. 1-10 Portions of two infinite graphs. 

In this book we shall confine ourselves to the study of finite graphs, and 
unless otherwise stated the term “graph” will always mean a finite graph. 

1-4. INCIDENCE AND DEGREE 

When a vertex v i is an end vertex of some edge e } , v. and e } are said to be 
incident with {on or to) each other. In Fig. 1-1, for example, edges e 2 , e 6 , and 
e n are incident with vertex v 4 . Two nonparallel edges are said to be adjacent 
if they are incident on a common vertex. For example, e 2 and e 7 in Fig. 1-1 
are adjacent. Similarly, two vertices are said to be adjacent if they are the end 
vertices of the same edge. In Fig. 1-1, v 4 and v 5 are adjacent, but v y and v 4 are 
not. 

The number of edges incident on a vertex v n with self-loops counted twice, 
is called the degree , d(vf, of vertex v r In Fig. 1-1, for example, d(v { ) = 



Fig. 1-1 A graph with five vertices and 
seven edges. 
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d(v 3 ) = d(v 4 ) — 3, d(v 2 ) = 4, and d(v 5 ) = 1. The degree of a vertex is some¬ 
times also referred to as its valency. 

Let us now consider a graph G with e edges and n vertices v { , v 2 ,, . ., v n . 
Since each edge contributes two degrees, the sum of the degrees of all vertices 
in G is twice the number of edges in G. That is, 

£d(v t ) = 2e. (1-1) 

/= i 

Taking Fig. 1-1 as an example, once more, 

d(v ,) + d(v 2 ) + d(v 3 ) + d(v A ) + d(v 5 ) 

= 3 + 4 + 3 + 3+1 = 14== twice the number of edges. 

From Eq. (1-1) we shall derive the following interesting result. 

Theorem 1-1 

The number of vertices of odd degree in a graph is always even. 

Proof: If we consider the vertices with odd and even degrees separately, the 
quantity in the left side of Eq. (1-1) can be expressed as the sum of two sums, each 
taken over vertices of even and odd degrees, respectively, as follows: 

2 d( v i) = 2 d(yj) + 2 d(v k ). ( 1 - 2 ) 

i = 1 even odd 

Since the left-hand side in Eq. (1-2) is even, and the first expression on the 
right-hand side is even (being a sum of even numbers), the second expression must 
also be even: 

2 d(v k ) = an even number. (1-3) 

odd 

Because in Eq. (1-3) each d(v k ) is odd, the total number of terms in the sum must 
be even to make the sum an even number. Hence the theorem. ■ 

A graph in which all vertices are of equal degree is called a regular graph 
(or simply a regular ). The graph of three utilities shown in Fig. 1-7 is a regular 
of degree three. 

1-5. ISOLATED VERTEX. PENDANT VERTEX. 

AND NULL GRAPH 

A vertex having no incident edge is called an isolated vertex. In other 
words, isolated vertices are vertices with zero degree. Vertices v x and v 7 in 
Fig. 1-11, for example, are isolated vertices. A vertex of degree one is called 
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Fig. 1-11 Graph containing isolated vertices, series edges, and a 
pendant vertex. 

a pendant vertex or an end vertex. Vertex in Fig. 1-11 is a pendant vertex. 
Two adjacent edges are said to be in series if their common vertex is of degree 
two. In Fig. 1-11, the two edges incident on v x are in series. 

In the definition of a graph G = (V, E ), it is possible for the edge set E to 
be empty. Such a graph, without any edges, is called a null graph. In other 
words, every vertex in a null graph is an isolated vertex. A null graph of six 
vertices is shown in Fig. 1-12. Although the edge set E may be empty, the 


• v i 


9 v 2 


«U 4 9V S 


• V 6 

Fig. 1-12 Null graph of six vertices. 


vertex set V must not be empty; otherwise, there is no graph. In other words, 
by definition, a graph must have at least one vertex.t 

tSome authors (see, for example, [2-9], p. 1, or [15-58], p. 17) do allow the case in 
which the vertex set V is also empty. This they call the null graph, and they call a graph 
with E = 0 and f^0a vertex graph. For our purposes this distinction is of no con¬ 
sequence. For a lively discussion on paradoxes arising out of different definitions of the 
null graph, see pp. 40-41 in Theory of Graphs: a Basis for Network Theory, by L. M. 
Maxwell and M. B. Reed (Pergamon Press, N. Y. 1971). 
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1-6. A BRIEF HISTORY OF GRAPH THEORY 

As mentioned before, graph theory was born in 1736 with Euler’s paper 
in which he solved the Konigsberg bridge problem [l-4].t For the next 100 
years nothing more was done in the field. 

In 1847, G. R. Kirchhoff (1824-1887) developed the theory of trees for 
their applications in electrical networks [1-6]. Ten years later, A. Cayley 
(1821-1895) discovered trees while he was trying to enumerate the isomers of 
saturated hydrocarbons C„H 2n+2 [1-3]. 

About the time of Kirchhoff and Cayley, two other milestones in graph 
theory were laid. One was the four-color conjecture , which states that four 
colors are sufficient for coloring any atlas (a map on a plane) such that the 
countries with common boundaries have different colors. 

It is believed that A. F. Mobius (1790-1868) first presented the four-color 
problem in one of his lectures in 1840. About 10 years later, A. De Morgan 
(1806-1871) discussed this problem with his fellow mathematicians in Lon¬ 
don. De Morgan’s letter is the first authenticated reference to the four-color 
problem. The problem became well known after Cayley published it in 1879 
in the first volume of the Proceedings of the Royal Geographic Society. To this 
day, the four-color conjecture is by far the most famous unsolved problem in 
graph theory; it has stimulated an enormous amount of research in the field 
[ 1 - 11 ]. 

The other milestone is due to Sir W. R. Hamilton (1805-1865). In the year 
1859 he invented a puzzle and sold it for 25 guineas to a game manufacturer 
in Dublin. The puzzle consisted of a wooden, regular dodecahedron (a poly¬ 
hedron with 12 faces and 20 corners, each face being a regular pentagon and 
three edges meeting at each corner; see Fig. 2-21). The corners were marked 
with the names of 20 important cities: London, New York, Delhi, Paris, and 
so on. The object in the puzzle was to find a route along the edges of the 
dodecahedron, passing through each of the 20 cities exactly once [1-12]. 

Although the solution of this specific problem is easy to obtain (as we 
shall see in Chapter 2), to date no one has found a necessary and sufficient 
condition for the existence of such a route (called Hamiltonian circuit) in an 
arbitrary graph. 

This fertile period was followed by half a century of relative inactivity. 
Then a resurgence of interest in graphs started during the 1920s. One of the 
pioneers in this period was D. Konig. He organized the work of other mathe¬ 
maticians and his own and wrote the first book on the subject, which was 
published in 1936 [1-7]. 

The past 30 years has been a period of intense activity in graph theory— 
both pure and applied. A great deal of research has been done and is being 

t Bracketed numbers refer to references at the end of chapters. 
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done in this area. Thousands of papers have been published and more than 
a dozen books written during the past decade. Among the current leaders 
in the field are Claude Berge, Oystein Ore (recently deceased), Paul Erdos, 
William Tutte, and Frank Harary. 


SUMMARY 

In this chapter some basic concepts of graph theory have been introduced, 
and some elementary results have been obtained. An attempt has also been 
made to show that graphs can be used to represent almost any problem 
involving discrete arrangements of objects, where concern is not with the 
internal properties of these objects but with the relationships among them. 
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PROBLEMS 

1-1. Draw all simple graphs of one, two, three, and four vertices. 

1-2. Draw graphs representing problems of (a) two houses and three utilities; (b) four 

houses and four utilities, say, water, gas, electricity, and telephone. 

1-3. Name 10 situations (games, activities, real-life problems, etc.) that can be repre¬ 
sented by means of graphs. Explain what the vertices and the edges denote. 

1-4. Draw the graph of the Wheatstone bridge circuit. 

1-5. Draw graphs of the following chemical compounds: (a) CH 4 , (b) C 2 H 6 , (c) CeHe, 
(d) N 2 O 3 . {Hint: Represent atoms by vertices and chemical bonds between them by 
edges.) 

1-6. Draw a graph with 64 vertices representing the squares of a chessboard. Join these 
vertices appropriately by edges, each representing a move of the knight. You will 
see that in this graph every vertex is of degree two, three, four, six, or eight. How 
many vertices are of each type? 

1-7. Given a maze as shown in Fig. 1-13, represent this maze by means of a graph such 
that a vertex denotes either a corridor or a dead end (as numbered). An edge repre¬ 
sents a possible path between two vertices. (This is one of numerous mazes that 
were drawn or built by the Hindus, Greeks, Romans, Vikings, Arabs, etc.) 



Fig. 1-13 A maze. 
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1 - 8 . Decanting problem. You are given three vessels A, B, and C of capacities 8, 5, 
and 3 gallons, respectively. A is filled, while B and C are empty. Divide the liquid 
in A into two equal quantities. [Hint: Let a, b, and c be the amounts of liquid in 
A, B, and C, respectively. We have a + b + c = 8 at all times. Since at least one 
of the vessels is always empty or full, at least one of the following equations must 
always be satisfied: a = 0, a = 8; b = 0, b = 5; c = 0, c — 3. You will find that 
with these constraints there are 16 possible states (situations) in this process. 
Represent this problem by means of a 16-vertex graph. Each vertex stands for a 
state and each edge for a permissible change of states between its two end vertices. 
Now when you look at this graph it will be clear to you how to go from state (8, 0, 0) 
to state (4, 4, 0).] This is the classical decanting problem. 

1 - 9 . Convince yourself that an infinite graph with a finite number of edges (i.e., a graph 
with a finite number of edges and an infinite number of vertices) must have an 
infinite number of isolated vertices. 

1 - 10 . Show that an infinite graph with a finite number of vertices (i.e., a graph with a 
finite number of vertices and an infinite number of edges) will have at least one 
pair of vertices (or one vertex in case of parallel self-loops) joined by an infinite 
number of parallel edges. 

1 - 11 . Convince yourself that the maximum degree of any vertex in a simple graph with 
n vertices is n — 1. 

1 - 12 . Show that the maximum number of edges in a simple graph with n vertices is 
n(n — l)/2. 



PATHS AND CIRCUITS 


This chapter serves two purposes. The first is to introduce additional 
concepts and terms in graph theory. These concepts, such as paths, circuits, 
and Euler graphs, deal mainly with the nature of connectivity in graphs. The 
degree of vertices, which is a local property of each vertex, will be shown to 
be related to the more global properties of the graph. 

The second purpose is to illustrate with examples how to solve actual 
problems using graph theory. The celebrated Konigsberg bridge problem, 
which was introduced in Chapter 1, will be solved. The solution of the seating 
arrangement problem, also introduced in Chapter 1, will be obtained by 
means of Hamiltonian circuits. A third problem, which involves stacking 
four multicolored cubes, will also be solved. These three unrelated problems 
will demonstrate the problem-solving power of graph theory. The reader may 
attempt to solve these problems without using graphs; the difficulty of such 
an approach will quickly convince him of the value of graph theory. 

2-1. ISOMORPHISM 

In geometry two figures are thought of as equivalent (and called con¬ 
gruent) if they have identical behavior in terms of geometric properties. 
Likewise, two graphs are thought of as equivalent (and called isomorphic) if 
they have identical behavior in terms of graph-theoretic properties. More 
precisely: Two graphs G and G' are said to be isomorphic (to each other) if 
there is a one-to-one correspondence between their vertices and between 
their edges such that the incidence relationship is preserved. In other words, 
suppose that edge e is incident on vertices v x and v 2 in G; then the correspond¬ 
ing edge e' in G' must be incident on the vertices v\ and v' 2 that correspond to 
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Fig. 2-1 Isomorphic graphs. 


v x and v 2 , respectively. For example, one can verify that the two graphs in 
Fig. 2-1 are isomorphic. The correspondence between the two graphs is as 
follows: The vertices a, b, c, d, and e correspond to v 2 , v 3 , v 4 , and v 5 , 
respectively. The edges 1, 2, 3, 4, 5, and 6 correspond to e t , e 2 , e 3 , e 4 , e 5 , and 
e 6 , respectively. 

Except for the labels (i.e., names) of their vertices and edges, isomorphic 
graphs are the same graph, perhaps drawn differently. As indicated in Chap¬ 
ter I, a given graph can be drawn in many different ways. For example, Fig. 
2-2 shows two different ways of drawing the same graph. 



Fig. 2-2 Isomorphic graphs. 

It is not always an easy task to determine whether or not two given graphs 
are isomorphic. For instance, the three graphs shown in Fig. 2-3 are all 
isomorphic, but just by looking at them you cannot tell that. It is left as an 
exercise for the reader to show, by redrawing and labeling the vertices and 
edges, that the three graphs in Fig. 2-3 are isomorphic (see Problem 2-3). 

It is immediately apparent by the definition of isomorphism that two 
isomorphic graphs must have 

1. The same number of vertices. 

2. The same number of edges. 

3. An equal number of vertices with a given degree. 
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Fig. 2-4 Two graphs that are not isomorphic. 

However, these conditions are by no means sufficient. For instance, the two 
graphs shown in Fig. 2-4 satisfy all three conditions, yet they are not isomor¬ 
phic. That the graphs in Figs. 2-4(a) and (b) are not isomorphic can be shown 
as follows: If the graph in Fig. 2-4(a) were to be isomorphic to the one in (b), 
vertex jt must correspond to y, because there are no other vertices of degree 
three. Now in (b) there is only one pendant vertex, w, adjacent to y, while in 
(a) there are two pendant vertices, u and v, adjacent to jc. 

Finding a simple and efficient criterion for detection of isomorphism is 
still actively pursued and is an important unsolved problem in graph theory. 
In Chapter 11 we shall discuss various proposed algorithms and their pro¬ 
grams for automatic detection of isomorphism by means of a computer. 
For now, we move to a different topic. 


2-2. SUBGRAPHS 

A graph g is said to be a subgraph of a graph G if all the vertices and all 
the edges of g are in (7, and each edge of g has the same end vertices in g as 
in G. For instance, the graph in Fig. 2-5(b) is a subgraph of the one in Fig. 
2-5(a). (Obviously, when considering a subgraph, the original graph must 
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1 



(b) 


Fig. 2-5 Graph (a) and one of its subgraphs (b). 

not be altered by identifying two distinct vertices, or by adding new edges or 
vertices.) The concept of subgraph is akin to the concept of subset in set 
theory. A subgraph can be thought of as being contained in (or a part of) 
another graph. The symbol from set theory, g a G, is used in stating “g is 
a subgraph of G 

The following observations can be made immediately: 

1. Every graph is its own subgraph. 

2. A subgraph of a subgraph of G is a subgraph of G. 

3. A single vertex in a graph G is a subgraph of G. 

4. A single edge in G, together with its end vertices, is also a subgraph of 
G. 

Edge-Disjoint Subgraphs: Two (or more) subgraphs g, and g 2 of a graph G 
are said to be edge disjoint if gj and g 2 do not have any edges in common. 
For example, the two graphs in Figs. 2-7(a) and (b) are edge-disjoint sub¬ 
graphs of the graph in Fig. 2-6. Note that although edge-disjoint graphs do 
not have any edge in common, they may have vertices in common. Sub¬ 
graphs that do not even have vertices in common are said to be vertex dis¬ 
joint. (Obviously, graphs that have no vertices in common cannot possibly 
have edges in common.) 

2-3. A PUZZLE WITH MULTICOLORED CUBES 

Now we shall take a brief pause to illustrate, with an example, how a 
problem can be solved by using graphs. Two steps are involved here: First, 
the physical problem is converted into a problem of graph theory. Second, 
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the graph-theory problem is then solved. Let us consider the following prob¬ 
lem, a well-known puzzle available in toy stores (under the name Instant 
Insanity ). 

Problem: We are given four cubes. The six faces of every cube are various¬ 
ly colored blue, green, red, or white. Is it possible to stack the cubes one on 
top of another to form a column such that no color appears twice on any of 
the four sides of this column? (Clearly, a trial-and-error method is unsatis¬ 
factory, because we may have to try all 41,472 (= 3 x 24 x 24 x 24) poss¬ 
ibilities.) 

Solution: Step 1: Draw a graph with four vertices B, G, R, and W —one 
for each color (Fig. 2-6). Pick a cube and call it cube 1; then represent its 
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Fig. 2-6 Four cubes unfolded and the graph representing their 
colors. 

three pairs of opposite faces by three edges, drawn between the vertices with 
appropriate colors. In other words, if a blue face in cube 1 has a white face 
opposite to it, draw an edge between vertices B and W in the graph. Do the 
same for the remaining two pairs of faces in cube 1. Put label 1 on all three 
edges resulting from cube 1. A self-loop with the edge labeled 1 at vertex R, 
for instance, would result if cube 1 had a pair of opposite faces both colored 



sec. 2-4 


WALKS, PATHS, AND CIRCUITS 


19 


red. Repeat the procedure for the other three cubes one by one on the same 
graph until we have a graph with four vertices and 12 edges. A particular set 
of four colored cubes and their graph are shown in Fig. 2-6. 

Step 2: Consider the graph resulting from this representation. The degree 
of each vertex is the total number of faces with the corresponding color. For 
the cubes of Fig. 2-6, we have five blue faces, six green, seven red, and six 
white. 

Consider two opposite vertical sides of the desired column of four cubes, 
say facing north and south. A subgraph (with four edges) will represent these 
eight faces—four facing south and four north. Each of the four edges in this 
subgraph will have a different label—1,2, 3, and 4. Moreover, no color occurs 
twice on either the north side or south side of the column if and only if every 
vertex in this subgraph is of degree two. 

Exactly the same argument applies to the other two sides, east and west, 
of the column. 

Thus the four cubes can be arranged (to form a column such that no color 
appears more than once on any side) if and only if there exist two edge-dis¬ 
joint subgraphs, each with four edges, each of the edges labeled differently, 
and such that each vertex is of degree two. For the set of cubes shown in Fig. 
2-6, this condition is satisfied, and the two subgraphs are shown in Fig. 2-7. 



(a) North-South Subgraph 


(b) East-West Subgraph 


Fig. 2-7 Two edge-disjoint subgraphs of the graph in Fig. 2-6. 


2-4. WALKS, PATHS, AND CIRCUITS 

A walk is defined as a finite alternating sequence of vertices and edges, 
beginning and ending with vertices, such that each edge is incident with the 
vertices preceding and following it. No edge appears (is covered or traversed) 
more than once in a walk. A vertex, however, may appear more than once. 
In Fig. 2-8(a), for instance, v l a v 2 b v 3 c v 3 d v A e v 2 f v 5 is a walk shown with 
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(a) An Open Walk (b) A Path of Length Three 

Fig. 2-8 A walk and a path. 

heavy lines. A walk is also referred to as an edge train or a chain. The set of 
vertices and edges constituting a given walk in a graph G is clearly a subgraph 
of G. 

Vertices with which a walk begins and ends are called its terminal vertices. 
Vertices v x and v 5 are the terminal vertices of the walk shown in Fig. 2-8(a). 
It is possible for a walk to begin and end at the same vertex. Such a walk is 
called a closed walk. A walk that is not closed (i.e., the terminal vertices are 
distinct) is called an open walk [Fig. 2-8(a)]. 

An open walk in which no vertex appears more than once is called a path 
(or a simple path or an elementary path). In Fig. 2-8, v x av 2 b v 3 dv 4 is a path, 
whereas v x a v 2 b v 3 c v 3 d e v 2 f v 5 is not a path. In other words, 
a path does not intersect itself. The number of edges in a path is called the 
length of a path. It immediately follows, then, that an edge which is not a self¬ 
loop is a path of length one. It should also be noted that a self-loop can be 
included in a walk but not in a path (Fig. 2-8). 

The terminal vertices of a path are of degree one, and the rest of the ver¬ 
tices (called intermediate vertices ) are of degree two. This degree, of course, 
is counted only with respect to the edges included in the path and not the 
entire graph in which the path may be contained. 

A closed walk in which no vertex (except the initial and the final vertex) 
appears more than once is called a circuit. That is, a circuit is a closed, non- 




Fig. 2-9 Three different circuits. 
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intersecting walk. In Fig. 2-8(a), v 2 b v 3 d v A ev 2 is, for example, a circuit. 
Three different circuits are shown in Fig. 2-9. Clearly, every vertex in a circuit 
is of degree two; again, if the circuit is a subgraph of another graph, one must 
count degrees contributed by the edges in the circuit only. 

A circuit is also called a cycle, elementary cycle, circular path, and polygon. 
In electrical engineering a circuit is sometimes referred to as a loop —not to 
be confused with self-loop. (Every self-loop is a circuit, but not every circuit 
is a self-loop.) 

The definitions in this section are summarized in Fig. 2-10. The arrows 
are in the direction of increasing restriction. 

You may have observed that although the concepts of a path and a cir¬ 
cuit are very simple, the formal definition becomes involved. 


A non-intersecting 
open walk in O' 



Fig. 2-10 Walks, paths, and circuits as subgraphs. 


2-5. CONNECTED GRAPHS, DISCONNECTED 
GRAPHS, AND COMPONENTS 

Intuitively, the concept of connectedness is obvious. A graph is connected 
if we can reach any vertex from any other vertex by traveling along the edges. 
More formally: 

A graph G is said to be connected if there is at least one path between every 
pair of vertices in G. Otherwise, G is disconnected. For instance, the graph in 
Fig. 2-8(a) is connected, but the one in Fig. 2-11 is disconnected. A null 
graph of more than one vertex is disconnected (Fig. 1-12). 

It is easy to see that a disconnected graph consists of two or more con¬ 
nected graphs. Each of these connected subgraphs is called a component. The 
graph in Fig. 2-11 consists of two components. Another way of looking at a 
component is as follows: Consider a vertex v. in a disconnected graph G. By 
definition, not all vertices of G are joined by paths to v r Vertex v t and all the 
vertices of G that have paths to v t , together with all the edges incident on 
them, form a component. Obviously, a component itself is a graph. 
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Fig. 2-11 A disconnected graph with 
two components. 


Theorem 2-1 


A graph G is disconnected if and only if its vertex set V can be partitioned into 
two nonempty, disjoint subsets V, and V 2 such that there exists no edge in G 
whose one end vertex is in subset V x and the other in subset V 2 . 


Proof: Suppose that such a partitioning exists. Consider two arbitrary vertices 
a and b of G, such that a e V\ and be V 2 . No path can exist between vertices 
a and b ; otherwise, there would be at least one edge whose one end vertex would 
be in V x and the other in V 2 . Hence, if a partition exists, G is not connected. 

Conversely, let G be a disconnected graph. Consider a vertex a in G. Let V x 
be the set of all vertices that are joined by paths to a. Since G is disconnected, 
V x does not include all vertices of G. The remaining vertices will form a (nonempty) 
set V 2 . No vertex in V x is joined to any in V 2 by an edge. Hence the partition. ■ 

Two interesting and useful results involving connectedness are: 

Theorem 2-2 

If a graph (connected or disconnected) has exactly two vertices of odd degree, 
there must be a path joining these two vertices. 

Proof: Let G be a graph with all even verticesf except vertices v { and v 2 , which 
are odd. From Theorem 1-1, which holds for every graph and therefore for every 
component of a disconnected graph, no graph can have an odd number of odd 
vertices. Therefore, in graph G, v x and v 2 must belong to the same component, 
and hence must have a path between them. ■ 


Theorem 2-3 

A simple graph (i.e., a graph without parallel edges or self-loops) with n vertices 
and k components can have at most (n — k)(n ----- k I l)/2 edges. 


Proof: Let the number of vertices in each of the k components of a graph G 
be n x , n 2 , . . . , n k . Thus we have 

n x + n 2 + • • • T = n, 
n t > 1 . 

f For brevity, a vertex with odd degree is called an odd vertex, and a vertex with even 
degree an even vertex. 
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The proof of the theorem depends on an algebraic inequalityt 

2 nf <n 2 -{k - 1)(2 n - k). (2-1) 

i- i 

Now the maximum number of edges in the /th component of G (which is a simple 
connected graph) is \,«,■(«,■ — 1). (See Problem 1-12.) Therefore, the maximum 
number of edges in G is 

y £ (»/ - IK = y (i if?) - y (2-2) 

< y[« 2 - (k - 1)(2« - A:)] - y. from (2-1) 

y.(» -*)(«-A: + 1). ■ (2-3) 

It may be noted that Theorem 2-3 is a generalization of the result in Problem 

1- 12. The solution to Problem 1-12 is given by (2-3), where k 0 ; 1. 

Now we are equipped to handle the Konigsberg bridge problem intro¬ 
duced in Chapter 1. 

2- 6. EULER GRAPHS 

As mentioned in Chapter 1, graph theory was born in 1736 with Euler’s 
famous paper in which he solved the Konigsberg bridge problem. In the same 
paper, Euler posed (and then solved) a more general problem: In what type 
of graph G is it possible to find a closed walk running through every edge of 
G exactly once ? Such a walk is now called an Euler line, and a graph that con¬ 
sists of an Euler line is called an Euler graph. More formally: 

If some closed walk in a graph contains all the edges of the graph, then the 
walk is called an Euler line and the graph an Euler graph. 

By its very definition a walk is always connected. Since the Euler line 
(which is a walk) contains all the edges of the graph, an Euler graph is always 
connected, except for any isolated vertices the graph may have. Since isolated 
vertices do not contribute anything to the understanding of an Euler graph, 
it is hereafter assumed that Euler graphs do not have any isolated vertices 
and are therefore connected. 

Now we shall state and prove an important theorem, which will enable us 
to tell immediately whether or not a given graph is an Euler graph. 

t Proof: 21 £= i K — 1) = n — k. Squaring both sides, 

^ 2 ( n > ~ 1)^ = n 1 + k 2 — Ink 

or Xf= i K 2 ~ 2/I,-) + k + nonnegative cross terms = n 2 + k 2 — 2 nk because (//,• — 1)>:0, 
for all /. Therefore, i n f < n 1 + k 1 — Ink — k + 2n — n 2 — (k — 1)(2« — k). ■ 
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Theorem 2-4 

A given connected graph G is an Euler graph if and only if all vertices of G are 
of even degree. 

Proof: Suppose that G is an Euler graph. It therefore contains an Euler line 
(which is a closed walk). In tracing this walk we observe that every time the walk 
meets a vertex v it goes through two “new” edges incident on v — with one we 
“entered” v and with the other “exited.” This is true not only of all intermediate 
vertices of the walk but also of the terminal vertex, because we “exited” and 
“entered” the same vertex at the beginning and end of the walk, respectively. Thus 
if G is an Euler graph, the degree of every vertex is even. 

To prove the sufficiency of the condition, assume that all vertices of G are of 
even degree. Now we construct a walk starting at an arbitrary vertex v and going 
through the edges of G such that no edge is traced more than once. We continue 
tracing as far as possible. Since every vertex is of even degree, we can exit from 
every vertex we enter; the tracing cannot stop at any vertex but v. And since v is 
also of even degree, we shall eventually reach v when the tracing comes to an end. 
If this closed walk h we just traced includes all the edges of G, G is an Euler graph. 
If not, we remove from G all the edges in h and obtain a subgraph ti of G formed 
by the remaining edges. Since both G and h have all their vertices of even degree, 
the degrees of the vertices of ti are also even. Moreover, ti must touch h at least 
at one vertex a, because G is connected. Starting from a, we can again construct 
a new walk in graph ti. Since all the vertices of ti are of even degree, this walk in 
ti must terminate at vertex a; but this walk in ti can be combined with h to form 
a new walk, which starts and ends at vertex v and has more edges than h. This 
process can be repeated until we obtain a closed walk that traverses all the edges 
of G. Thus G is an Euler graph. ■ 

Konigsberg Bridge Problem: Now looking at the graph of the Konigsberg 
bridges (Fig. 1-5), we find that not all its vertices are of even degree. Hence, 
it is not an Euler graph. Thus it is not possible to walk over each of the seven 
bridges exactly once and return to the starting point. 

One often encounters Euler lines in various puzzles. The problem common 
to these puzzles is to find how a given picture can be drawn in one continuous 
line without retracing and without lifting the pencil from the paper. Two such 
pictures are shown in Fig. 2-12. The drawing in Fig. 2-12(a) is called Moham¬ 
med's scimitars and is believed to have come from the Arabs. The one in Fig. 
2-12(b) is, of course, the star of David. (Equal time!) 

In defining an Euler line some authors drop the requirement that the walk 
be closed. For example, the walk a I c2d3a4b5d6elbin Fig. 2-13, which 
includes all the edges of the graph and does not retrace any edge, is not closed. 
The initial vertex is a and the final vertex is b. We shall call such an open 
walk that includes (or traces or covers) all edges of a graph without retracing 
any edge a unicursal line or an open Euler line. A (connected) graph that has 
a unicursal line will be called a unicursal graph. 
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Fig. 2-12 Two Euler graphs. 



Fig. 2-13 Unicursal graph. 


It is clear that by adding an edge between the initial and final vertices of 
a unicursal line we shall get an Euler line. Thus a connected graph is unicursal 
if and only if it has exactly two vertices of odd degree. This observation can 
be generalized as follows: 

Theorem 2-5 

In a connected graph G with exactly 2k odd vertices, there exist k edge-disjoint 
subgraphs such that they together contain all edges of G and that each is a unicursal 
graph. 

Proof: Let the odd vertices of the given graph G be named v u v 2 , . . . , v k ; 
w u w> 2 , . . . , w k in any arbitrary order. Add k edges to G between the vertex pairs 
b'l, Wi), 0'2, W 2 ), . . . , (v k , w*) to form a new graph G '. 

Since every vertex of G' is of even degree, G' consists of an Euler line p. Now 
if we remove from p the k edges we just added (no two of these edges are incident 
on the same vertex), p will be split into k walks, each of which is a unicursal line: 
The first removal will leave a single unicursal line; the second removal will split 
that into two unicursal lines; and each successive removal will split a unicursal 
line into two unicursal lines, until there are k of them. Thus the theorem. ■ 
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We shall interrupt our study of Euler graphs to define some commonly 
used graph-theoretic operations. One of these operations is required immedi¬ 
ately in the next section; others will be needed later. 

2-7. OPERATIONS ON GRAPHS 

As is the case with most mathematical entities, it is convenient to consider 
a large graph as a combination of small ones and to derive its properties 
from those of the small ones. Since graphs are defined in terms of the sets of 
vertices and edges, it is natural to employ the set-theoretical terminology to 
define operations between graphs. In particular: 

The union of two graphs G, = (K,, £j) and G 2 — (V 2 , E 2 ) is another 
graph G 3 (written as G 3 = Gj U G 2 ) whose vertex set V 3 = V x u V 2 and the 
edge set E 3 — E x u E 2 . Likewise, the intersection G x n G 2 of graphs G l and 
G 2 is a graph G 4 consisting only of those vertices and edges that are in both 
(7[ and G 2 . The ring sum of two graphs G, and G 2 (written as G, © G 2 ) is a 
graph consisting of the vertex set K, U V 2 and of edges that are either in G x 
or G 2 , but not in both. Two graphs and their union, intersection, and ring sum 
are shown in Fig. 2-14.t 

It is obvious from their definitions that the three operations just men¬ 
tioned are commutative. That is, 

G x u g 2 = g 2 u g„ Gj n g 2 = g 2 n G„ 
g,©g 2 = g 2 ©g,. 

If G x and G 2 are edge disjoint, then G, n G 2 is a null graph, and G, ©G 2 = 
G, U G 2 . If G, and G 2 are vertex disjoint, then G, n G 2 is empty. 

For any graph G, 

GuG = GnG = G, 

and _ 

G @ G = a null graph. 

Ifgis a subgraph of G, then G@g is, by definition, that subgraph of G which 
remains after all the edges in g have been removed from G. Therefore, G@g 
is written as G — g, whenever g <= G. Because of this complementary nature, 
G ® g = G — g is often called the complement of g in G. 

Decomposition: A graph G is said to have been decomposed into two sub¬ 
graphs g, and g 2 if 

gi U g 2 = G, 

aild g x n g 2 = a null graph. 


flf an edge e t is in two graphs G , and G 2 , its end vertices in C, must have the same labels 
as in Go. 
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(i | n a 2 g,©6' 2 

Fig. 2-14 Union, intersection, and ring sum of two graphs. 

In other words, every edge of G occurs either in g x or in g 2 , but not in both. 
Some of the vertices, however, may occur in both gj and g 2 . In decomposi¬ 
tion, isolated vertices are disregarded. A graph containing m edges [c,,c 2 , 
. . . , e m } can be decomposed in 2 m ~' — 1 different ways into pairs of sub¬ 
graphs g,, g 2 (why?). 

Although union, intersection, and ring sum have been defined for a pair 
of graphs, these definitions can be extended in an obvious way to include any 
finite number of graphs. Similarly, a graph G can be decomposed into more 
than two subgraphs—subgraphs that are (pairwise) edge disjoint and col¬ 
lectively include every edge in G. 

Deletion: If v t is a vertex in graph G , then G — v t denotes a subgraph of 
G obtained by deleting (i.e., removing) v t from G. Deletion of a vertex always 
implies the deletion of all edges incident on that vertex. (See Fig. 2-15.) Ifc y 
is an edge in G , then G — e } is a subgraph of G obtained by deleting <? y from 
G. Deletion of an edge does not imply deletion of its end vertices. Therefore 
G - e s = G 0 e s . 
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G 


(G - u,) 


(G - e f ) 


Fig. 2-15 Vertex deletion and edge deletion. 

Fusion: A pair of vertices a, b in a graph are said to be fused (merged or 
identified ) if the two vertices are replaced by a single new vertex such that every 
edge that was incident on either a or b or on both is incident on the new ver¬ 
tex. Thus fusion of two vertices does not alter the number of edges, but it 
reduces the number of vertices by one. See Fig. 2-16 for an example. 




Fig. 2-16 Fusion of vertices a and b. 

These are some of the elementary operations on graphs. More complex 
operations have been defined and are used in graph-theory literature. For a 
survey of such operations see the paper by Harary and Wilcox [2-10], 

2-8. MORE ON EULER GRAPHS 

The following are some more results on the important topic of Euler 
graphs. 

Theorem 2-6 

A connected graph G is an Euler graph if and only if it can be decomposed 
into circuits. 

Proof: Suppose graph G can be decomposed into circuits; that is, G is a union 
of edge-disjoint circuits. Since the degree of every vertex in a circuit is two, the 
degree of every vertex in G is even. Hence G is an Euler graph. 
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Conversely, let G be an Euler graph. Consider a vertex v x . There are at least 
two edges incident at v ,. Let one of these edges be between Vi and v 2 . Since vertex 
v 2 is also of even degree, it must have at least another edge, say between v 2 and 
v 3 . Proceeding in this fashion, we eventually arrive at a vertex that has previously 
been traversed, thus forming a circuit I\ Let us remove T from G. All vertices in 
the remaining graph (not necessarily connected) must also be of even degree. From 
the remaining graph remove another circuit in exactly the same way as we removed 
r from G. Continue this process until no edges are left. Hence the theorem. ■ 


Arbitrarily Traceable Graphs: Consider the graph in Fig. 2-17, which is 
an Euler graph. Suppose that we start from vertex a and trace the path a b c. 



Fig. 2-17 Arbitrarily traceable graph 
from c. 


Now at c we have the choice of going to a, d , or e. If we took the first choice, 
we would only trace the circuit a b c a, which is not an Euler line. Thus, start¬ 
ing from a , we cannot trace the entire Euleriine simply by moving along any 
edge that has not already been traversed. This raises the following interesting 
question: 

What property must a vertex v in an Euler graph have such that an Euler 
line is always obtained when one follows any walk from vertex v according to 
the single rule that whenever one arrives at a vertex one shall select any edge 
(which has not been previously traversed)? 

Such a graph is called an arbitrarily traceable graph from vertex v. For 
instance, the Euler graph in Fig. 2-17 is an arbitrarily traceable graph from 
vertex c, but not from any other vertex. The Euler graph in Fig. 2-18 is not 
arbitrarily traceable from any vertex; the graph in Fig. 2-19 is arbitrarily 



Fig. 2-18 Euler graph; not arbitrarily 
traceable. 



Fig. 2-19 Arbitrarily traceable graph 
from all vertices. 
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traceable from all its vertices. The following interesting theorem, due to Ore 
[2-5], answers the question just raised. 

Theorem 2-7 

An Euler graph G is arbitrarily traceable from vertex v in G if and only if every 
circuit in G contains v. 

For a proof of the theorem the reader is referred to [2-5]. 

2-9. HAMILTONIAN PATHS AND CIRCUITS 

An Euler line of a connected graph was characterized by the property of 
being a closed walk that traverses every edge of the graph (exactly once). A 
Hamiltonian circuit in a connected graph is defined as a closed walk that 
traverses every vertex of G exactly once, except of course the starting vertex, 
at which the walk also terminates. For example, in the graph of Fig. 2-20(a) 



Fig. 2-20 Hamiltonian circuits. 

starting at vertex v, if one traverses along the edges shown in heavy lines— 
passing through each vertex exactly once—one gets a Hamiltonian circuit. 
A Hamiltonian circuit for the graph in Fig. 2-20(b) is also shown by heavy 
lines. More formally: 

A circuit in a connected graph G is said to be Hamiltonian if it includes 
every vertex of G. Hence a Hamiltonian circuit in a graph of n vertices consists 
of exactly n edges. 

Obviously, not every connected graph has a Hamiltonian circuit. For 
example, neither of the graphs shown in Figs. 2-17 and 2-18 has a Hamil¬ 
tonian circuit. This raises the question: What is a necessary and sufficient 
condition for a connected graph G to have a Hamiltonian circuit? 
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Dodecahedron 




Fig. 2-21 Dodecahedron and its graph shown with a Hamiltonian 
circuit. 

This problem, first posed by the famous Irish mathematician Sir William 
Rowan Hamilton in 1859, is still unsolved. As was mentioned in Chapter I, 
Hamilton made a regular dodecahedron of wood [see Fig. 2-21(a)], each of 
whose 20 corners was marked with the name of a city. The puzzle was to 
start from any city and find a route along the edge of the dodecahedron that 
passes through every city exactly once and returns to the city of origin. The 
graph of the dodecahedron is given in Fig. 2-21(b), and one of many such 
routes (a Hamiltonian circuit) is shown by heavy lines. 

The resemblance between the problem of an Euler line and that of a 
Hamiltonian circuit is deceptive. The latter is infinitely more complex. Al¬ 
though one can find Hamiltonian circuits in many specific graphs, such as 
those shown in Figs. 2-20 and 2-21, there is no known criterion we can apply 
to determine the existence of a Hamiltonian circuit in general. There are, 
however, certain types of graphs that always contain Hamiltonian circuits,- 
as will be presently shown. 

Hamiltonian Path: If we remove any one edge from a Hamiltonian circuit, 
we are left with a path. This path is called a Hamiltonian path. Clearly, a 
Hamiltonian path in a graph G traverses every vertex of G. Since a Hamil¬ 
tonian path is a subgraph of a Hamiltonian circuit (which in turn is a sub¬ 
graph of another graph), every graph that has a Hamiltonian circuit also has 
a Hamiltonian path. There are, however, many graphs with Hamiltonian 
paths that have no Hamiltonian circuits (Problem 2-23). The length of a 
Hamiltonian path (if it exists) in a connected graph of n vertices is n — 1. 

In considering the existence of a Hamiltonian circuit (or path), we need 
only consider simple graphs. This is because a Hamiltonian circuit (or path) 
traverses every vertex exactly once. Hence it cannot include a self-loop or 
a set of parallel edges. Thus a general graph may be made simple by removing 
parallel edges and self-loops before looking for a Hamiltonian circuit in it. 

It is left as an exercise for the reader to show that neither of the two graphs 
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(a) (b) 

Fig. 2-22 Graphs without Hamiltonian circuits. 

shown in Fig. 2-22 has a Hamiltonian circuit (or Hamiltonian path). See 
Problem 2-24. 

What general class of graphs is guaranteed to have a Hamiltonian circuit? 
Complete graphs of three or more vertices constitute one such class. 

Complete Graph: A simple graph in which there exists an edge between 
every pair of vertices is called a complete graph. Complete graphs of two, 
three, four, and five vertices are shown in Fig. 2-23. A complete graph is 



Fig. 2-23 Complete graphs of two, three, four, and five vertices. 


sometimes also referred to as a universal graph or a clique. Since every vertex 
is joined with every other vertex through one edge, the degree of every vertex 
is n — 1 in a complete graph G of n vertices. Also the total number of edges 
in G is n(n — l)/2. See Problem 1-12. 

It is easy to construct a Hamiltonian circuit in a complete graph of n 
vertices. Let the vertices be numbered v { , v 2 ,... , v n . Since an edge exists 
between any two vertices, we can start from v i and traverse to v 2 , and v 3 , and 
so on to v n , and finally from v n to v v This is a Hamiltonian circuit. 

Number of Hamiltonian Circuits in a Graph: A given graph may contain 
more than one Hamiltonian circuit. Of interest are all the edge-disjoint 
Hamiltonian circuits in a graph. The determination of the exact number of 
edge-disjoint Hamiltonian circuits (or paths) in a graph in general is also an 
unsolved problem. However, the number of edge-disjoint Hamiltonian cir- 
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cuits in a complete graph with odd number of vertices is given by Theorem 
2 - 8 . 


Theorem 2-8 

In a complete graph with n vertices there are ( n — l)/2 edge-disjoint Hamil¬ 
tonian circuits, if n is an odd number > 3. 

Proof: A complete graph <7 of n vertices has n(n — l)/2 edges, and a Hamil¬ 
tonian circuit in G consists of n edges. Therefore, the number of edge-disjoint 
Hamiltonian circuits in G cannot exceed (n — l)/2. That there are (n — l)/2 edge- 
disjoint Hamiltonian circuits, when n is odd, can be shown as follows: 

The subgraph (of a complete graph of n vertices) in Fig. 2-24 is a Hamiltonian 
circuit. Keeping the vertices fixed on a circle, rotate the polygonal pattern clockwise 



a 3 


Fig. 2-24 Hamiltonian circuit; n is 
odd. 


by 360/(« - 1), 2- 360/(w - 1), 3-360/(n - 1), - 3)/2-360/(« - 1) degrees. 

Observe that each rotation produces a Hamiltonian circuit that has no edge in 
common with any of the previous ones. Thus we have (n — 3)/2 new Hamiltonian 
circuits, all edge disjoint from the one in Fig. 2-24 and also edge disjoint among 
themselves. Hence the theorem. ■ 


This theorem enables us to solve the problem of the seating arrangement 
at a round table, introduced in Chapter 1, as follows: 

Representing a member x by a vertex and the possibility of his sitting next 
to another member y by an edge between x and y, we construct a graph G. 
Since every member is allowed to sit next to any other member, G is a com¬ 
plete graph of nine vertices—nine being the number of people to be seated 
around the table. Every seating arrangement around the table is clearly a 
Hamiltonian circuit. 

The first day of their meeting they can sit in any order, and it will be a 
Hamiltonian circuit //,. The second day, if they are to sit such that every mem¬ 
ber must have different neighbors, we have to find another Hamiltonian cir¬ 
cuit H 2 in G, with an entirely different set of edges from those in H ,; that is, 
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//, and H 2 are edge-disjoint Hamiltonian circuits. From Theorem 2-8 the 
number of edge-disjoint Hamiltonian circuits in G is four; therefore, only 
four such arrangements exist among nine people. 

Another interesting result on the question of existence of Hamiltonian 
circuits in a graph, obtained by G. A. Dirac, is: 

Theorem 2-9 

A sufficient (but by no means necessary) condition for a simple graph G to 
have a Hamiltonian circuit is that the degree of every vertex in G be at least nf 2, 
where n is the number of vertices in G. 

Proof: For proof the reader is referred to the original paper by Dirac [2-3]. 


2-10. TRAVELING-SALESMAN PROBLEM 

A problem closely related to the question of Hamiltonian circuits is the 
traveling-salesman problem , stated as follows: A salesman is required to visit 
a number of cities during a trip. Given the distances between the cities, in 
what order should he travel so as to visit every city precisely once and return 
home, with the minimum mileage traveled? 

Representing the cities by vertices and the roads between them by edges, 
we get a graph. In this graph, with every edge e t there is associated a real 
number (the distance in miles, say), w(e,.). Such a graph is called a weighted 
graph ; w(c,) being the weight of edge e r 

In our problem, if each of the cities has a road to every other city, we have 
a complete weighted graph. This graph has numerous Hamiltonian circuits, 
and we are to pick the one that has the smallest sum of distances (or weights). 

The total number of different (not edge disjoint, of course) Hamiltonian 
circuits in a complete graph of n vertices can be shown to be (n — 1)1/2. This 
follows from the fact that starting from any vertex we have n — 1 edges to 
choose from the first vertex, n — 2 from the second, n — 3 from the third, 
and so on. These being independent choices, we get (n — 1)1 possible number 
of choices. This number is, however, divided by 2, because each Hamil¬ 
tonian circuit has been counted twice. 

Theoretically, the problem of the traveling salesman can always be solved 
by enumerating all (n — 1)1/2 Hamiltonian circuits, calculating the distance 
traveled in each, and then picking the shortest one. However, for a large 
value of n, the labor involved is too great even for a digital computer (try 
solving it for the 50 state capitals in the United States; n — 50). 

The problem is to prescribe a manageable algorithm for finding the short¬ 
est route. No efficient algorithm for problems of arbitrary size has yet been 
found, although many attempts have been made. Since this problem has 
applications in operations research, some specific large-scale examples have 
been worked out (see [2-1]). There are also available several heuristic methods 
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of solution that give a route very close to the shortest one, but do not guar¬ 
antee the shortest (see [2-4] for such a method). 


SUMMARY 

In this chapter we discussed the subgraph—a graph that is part of another 
graph. Walks, paths, circuits, Euler lines, Hamiltonian paths, and Hamil¬ 
tonian circuits in a graph G are its subgraphs with special properties. A given 
graph G can be characterized and studied in terms of the presence or absence 
of these subgraphs. Many physical problems can be represented by graphs 
and solved by observing the relevant properties of the corresponding graphs. 

Various types of walks discussed in this chapter are summarized in Fig. 
2-25. The arrows point in the direction of increasing restriction. 



Fig. 2-25 Different types of walks. 
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PROBLEMS 

2-1. Verify that the two graphs in Fig. 2-2 are isomorphic. Label the corresponding 
vertices and edges. 

2-2. Show by redrawing, step by step, that graphs (b) and (c) in Fig. 2-3 are isomorphic 
to (a). 

2-3. Show that the two graphs in Figs. 2-26(a) and (b) are isomorphic. 




(a) 


(b) 


Fig. 2-26 
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2-4. Construct three more examples to show that conditions 1,2, and 3 in Section 2-1 
are not sufficient for isomorphism between graphs. 

2-5. Prove that any two simple connected graphs with n vertices, all of degree two, are 
isomorphic. 

2-6. Are the two graphs in Fig. 2-27 isomorphic? Why? 



Fig. 2-27 


2-7. Given the set of cubes represented by the graph in Fig. 2-6, is it possible to stack 
all four cubes into a column such that each side shows only one color? Explain. 

2-8. Prove that a simple graph with n vertices must be connected if it has more than 
[(« — 1 ){n — 2)]/2 edges. {Hint: Use Theorem 2-3.) 

2-9. Prove that if a connected graph G is decomposed into two subgraphs gi and gz, 
there must be at least one vertex common between and gz- 

2-10. Prove that a connected graph G remains connected after removing an edge a from 
G, if and only if e,- is in some circuit in G. 

2-11. Draw a connected graph that becomes disconnected when any edge is removed 
from it. 

2-12. Prove that a graph with n vertices satisfying the condition of Problem 2-11 is (a) 
simple, and (b) has exactly n — 1 edges. 

2-13. What is the length of the path from the entrance to the center of the maze in 
Problem 1-7? 

2-14. List all the different paths between vertices 5 and 6 in Fig. 2-5(a). Give the length 
of each of these paths. 

2-15. Group the paths listed in Problem 2-14 into sets of edge-disjoint paths. Demon¬ 
strate that the union of two edge-disjoint paths between a pair of vertices forms a 
circuit. 

2-16. In a graph G let pi and pz be two different paths between two given vertices. Prove 
that pi © pz is a circuit or a set of circuits in G. 

2-17. Let a, b, and c be three distinct vertices in a graph. There is a path between a and 
b and also there is a path between b and c. Prove that there is a path between a and c. 

2-18. If the intersection of two paths is a disconnected graph, show that the union of the 
two paths has at least one circuit. 

2-19. You are given a 10-piece domino set whose titles have the following set of dots: 
(1,2); (1,3); (1,4); (1,5); (2,3); (2,4); (2,5); (3,4); (3,5); (4,5). Discuss the 
possibility of arranging the tiles in a connected series such that one number on a 
title always touches the same number on its neighbor. {Hint: Use a five-vertex 
complete graph and see if it is an Euler graph.) 

2-20. Is it possible to move a knight on a chessboard such that it completes every per- 
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missible move exactly once? A move between two squares is counted as one regard¬ 
less of the direction in which it is made. {Hint: Is the graph of Problem 1-6 uni- 
cursal ?) 

2-21. A round-robin tournament (when every player plays against every other) among 
n players {n being an even number) can be represented by a complete graph of n 
vertices. Discuss how you would schedule the tournaments to finish in the shortest 
possible time. 

2-22. Observe that there can be no path longer than a Hamiltonian path (if it exists) in 
a graph. 

2-23. Draw a graph that has a Hamiltonian path but does not have a Hamiltonian circuit. 

2-24. Show that neither of the graphs in Fig. 2-22 has a Hamiltonian path (and therefore 
no Hamiltonian circuit). [Hint: For Fig. 2-22(a), of all the edges incident at a vertex 
only two can be included in a Hamiltonian circuit. Count the number of edges 
that have to be excluded. You will find that 13 edges must be excluded from Fig. 
2-22(a). The number of remaining edges is insufficient to form a Hamiltonian circuit. 
For Fig. 2-22(b), first consider all vertices of degree two.] 

2-25. Show that the graph of a rhombic dodecahedron (with eight vertices of degree 
three and six vertices of degree four) has no Hamiltonian path (and therefore no 
Hamiltonian circuit). 

2-26. Draw a graph in which an Euler line is also a Hamiltonian circuit. What can you 
say about such graphs in general? 

2-27. Is it possible, starting from any of the 64 squares of the chessboard, to move a 
knight such that it occupies every square exactly once and returns to the initial 
position? If so, give one such tour. {Hint: Look for a Hamiltonian circuit in the 
graph of Problem 1-6.) 

2-28. Prove that a graph G with n vertices always has a Hamiltonian path if the sum of 
the degrees' of every pair of vertices v t , vj in G satisfies the condition 

d{vt) + d{vj)>n — 1. 

{Hint: First show that G is connected. Then use induction on path length in G.) 

2-29. Using the result of Problem 2-28, show that in a dancing ring of n children it is 
always possible to arrange the children so that everyone has a friend at each side if 
every child enjoys friendship with at least half the children. 
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discussed. 

The second part of the chapter introduces the spanning tree—another 
important notion in the theory of graphs. The relationships among circuits, 
trees, and so on, in a graph are explored. Unavoidably, as with Chapters 1 
and 2, this chapter also has a large number of definitions. In studying any new 
branch of mathematics, there is no way to avoid new terms and definitions. 


3-1. TREES 

A tree is a connected graph without any circuits. The graph in Fig. 3-1, 
for instance, is a tree. Trees with one, two, three, and four vertices are shown 
in Fig. 3-2. As pointed out in Chapter 1, a graph must have at least one vertex, 
and therefore so must a tree. Some authors allow the null tree , a tree without 
any vertices. We have excluded such an entity from being a tree. Similarly, 
as we are considering only finite graphs, our trees are also finite. 

It follows immediately from the definition that a tree has to be a simple 
graph, that is, having neither a self-loop nor parallel edges (because they both 
form circuits). 

Trees appear in numerous instances. The genealogy of a family is often 
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represented by means of a tree (in fact the term tree comes from family tree). 
A river with its tributaries and subtributaries can be represented by a tree. 
The sorting of mail according to zip code and the sorting of punched cards 
are done according to a tree (called decision tree or sorting tree). 

Figure 3-3 might represent the flow of mail. All the mail arrives at some 
local office, vertex N. The most significant digit in the zip code is read at N, 
and the mail is divided into 10 piles N u N 2 , . . . , N 9 , and N 0 , depending on 
the most significant digit. Each pile is further divided into 10 piles according 
to the second most significant digit, and so on, till the mail is subdivided into 
10 5 possible piles, each representing a unique five-digit zip code. 

In many sorting problems we have only two alternatives (instead of 10 as 
in the preceding example) at each intermediate vertex, representing a dicho¬ 
tomy, such as large or small, good or bad, 0 or 1. Such a decision tree with 
two choices at each vertex occurs frequently in computer programming and 
switching theory. We shall deal with such trees and their applications in Sec¬ 
tion 3-5. Let us first obtain a few simple but important theorems on the gene¬ 
ral properties of trees. 


3-2. SOME PROPERTIES OF TREES 

Theorem 3-1 

There is one and only one path between every pair of vertices in a tree, T. 

Proof: Since T is a connected graph, there must exist at least one path between 
every pair of vertices in T. Now suppose that between two vertices a and b of T 
there are two distinct paths. The union of these two paths will contain a circuit 
and T cannot be a tree. ■ 

Conversely: 

Theorem 3-2 

If in a graph G there is one and only one path between every pair of vertices, 
G is a tree. 

Proof: Existence of a path between every pair of vertices assures that G is 
connected. A circuit in a graph (with two or more vertices) implies that there is 
at least one pair of vertices a , b such that there are two distinct paths between a 
and b. Since G has one and only one path between every pair of vertices, G can 
have no circuit. Therefore, G is a tree. ■ 

Theorem 3-3 

A tree with n vertices has n — 1 edges. 

Proof: The theorem will be proved by induction on the number of vertices. 
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It is easy to see that the theorem is true for n = 1,2, and 3 (see Fig. 3-2). Assume 
that the theorem holds for all trees with fewer than n vertices. 

Let us now consider a tree T with n vertices. In T let e k be an edge with end 
vertices v ( and vj. According to Theorem 3-1, there is no other path between v t 
and vj except e k . Therefore, deletion of e k from T will disconnect the graph, as 
shown in Fig. 3-4. Furthermore, T — e k consists of exactly two components, and 
since there were no circuits in T to begin with, each of these components is a tree. 
Both these trees, t x and t 2 , have fewer than n vertices each, and therefore, by the 
induction hypothesis, each contains one less edge than the number of vertices in it. 
Thus T — e k consists of n — 2 edges (and n vertices). Hence T has exactly n — 1 
edges. ■ 

Theorem 3-4 

Any connected graph with n vertices and n — 1 edges is a tree. 

Proof: The proof of the theorem is left to the reader as an exercise (Problem 
3-5). 


You may have noticed another important feature of a tree: its vertices 
are connected together with the minimum number of edges. A connected 
graph is said to be minimally connected if removal of any one edge from it 
disconnects the graph. A minimally connected graph cannot have a circuit; 
otherwise, we could remove one of the edges in the circuit and still leave the 
graph connected. Thus a minimally connected graph is a tree. Conversely, if 
a connected graph G is not minimally connected, there must exist an edge 
e t in G such that G — e t is connected. Therefore, e,. is in some circuit, which 
implies that G is not a tree. Hence the following theorem: 

Theorem 3-5 

A graph is a tree if and only if it is minimally connected. 

The significance of Theorem 3-5 is obvious. Intuitively, one can see that 
to interconnect n distinct points, the minimum number of line segments 
needed is n — 1. It requires no background in electrical engineering to realize 
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#2 Fig. 3-5 Edge e added to G = gi U £ 2 . 

that to short (electrically) n pins together, one needs at least n — 1 pieces of 
wire. The resulting structure, according to Theorem 3-5, is a tree. 

We showed that a connected graph with n vertices and without any cir¬ 
cuits has n — 1 edges. We can also show that a graph with n vertices which 
has no circuit and has n — 1 edges is always connected (i.e., it is a tree), in 
the following theorem. 

Theorem 3-6 

A graph G with n vertices, n — 1 edges, and no circuits is connected. 

Proof: Suppose there exists a circuitless graph G with n vertices and n — 1 edges 
which is disconnected. In that case G will consist of two or more circuitless com¬ 
ponents. Without loss of generality, let G consist of two components, g i and g 2 . 
Add an edge e between a vertex v x in ^j and v 2 in g 2 (Fig. 3-5). Since there was no 
path between v x and v 2 in G, adding e did not create a circuit. Thus G u e is a cir¬ 
cuitless, connected graph (i.e., a tree) of n vertices and n edges, which is not possible, 
because of Theorem 3-3. ■ 

The results of the preceding six theorems can be summarized by saying 
that the following are five different but equivalent definitions of a tree. That is, 
a graph G with n vertices is called a tree if 

1. G is connected and is circuitless, or 

2. G is connected and has n — 1 edges, or 

3. G is circuitless and has n — 1 edges, or 

4. There is exactly one path between every pair of vertices in G, or 

5. G is a minimally connected graph. 

3-3. PENDANT VERTICES IN A TREE 

You must have observed that each of the trees shown in the figures has 
several pendant vertices (a pendant vertex was defined as a vertex of degree 
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Fig. 3-6 Tree of the monotonically increasing sequences in 4, 1, 
13, 7, 0, 2, 8, 11, 3. 


one). The reason is that in a tree of n vertices we have n — 1 edges, and hence 
2 (n — 1) degrees to be divided among n vertices. Since no vertex can be of 
zero degree, we must have at least two vertices of degree one in a tree. This 
of course makes sense only if n > 2. More formally: 

Theorem 3-7 

In any tree (with two or more vertices), there are at least two pendant vertices. 


An Application: The following problem is used in teaching computer 
programming. Given a sequence of integers, no two of which are the same, 
find the largest monotonically increasing subsequence in it. Suppose that the 
sequence given to us is 4, 1, 13, 7, 0, 2, 8, 11,3; it can be represented by a tree 
in which the vertices (except the start vertex) represent individual numbers 
in the sequence, and the path from the start vertex to a particular vertex v 
describes the monotonically increasing subsequence terminating in v. As 
shown in Fig. 3-6, this sequence contains four longest monotonically increas¬ 
ing subsequences, that is, (4, 7, 8, 11), (1, 7, 8, 11), (1, 2, 8, 11), and (0, 2, 8, 
11). Each is of length four. Such a tree used in representing data is referred 
to as a data tree by computer programmers. 
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3-4. DISTANCE AND CENTERS IN A TREE 

The tree in Fig. 3-7 has four vertices. Intuitively, it seems that vertex b is 
located more “centrally” than any of the other three vertices. We shall ex- 


d 


Fig. 3-7 Tree. 


plore this idea further and see if in a tree there exists a “center” (or centers). 
Inherent in the concept of a center is the idea of “distance,” so we must define 
distance before we can talk of a center. 

In a connected graph G, the distance d{v t , v } ) between two of its vertices 
v and Vj is the length of the shortest path (i.e., the number of edges in the 
shortest path) between them. 

The definition of distance between any two vertices is valid for any con¬ 
nected graph (not necessarily a tree). In a graph that is not a tree, there are 
generally several paths between a pair of vertices. We have to enumerate all 
these paths and find the length of the shortest one. (There may be several 
shortest paths.) 

For instance, some of the paths between vertices v x and v 2 in Fig. 3-8 are 
(a, e), (a, c,f), ( b , c, e ), (&,/), ( b , g, h ), and ( b , g , /, k). There are two shortest 
paths, (a, e ) and ( b,f ), each of length two. Hence d{v x , v 2 ) — 2. 

In a tree, since there is exactly one path between any two vertices (Theorem 
3-1), the determination of distance is much easier. For instance, in the tree of 
Fig. 3-7, d(a, b) = 1, d{a, c ) = 2, d(c, b) = 1, and so on. 

A Metric: Before we can legitimately call a function f(x, y) of two vari¬ 
ables a “distance” between them, this function must satisfy certain require¬ 
ments. These are 




Fig. 3-8 Distance between vi and vi is two. 
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1. Nonnegativity: f(x, > 0, and f(x, y) = 0 if and only if x = y. 

2. Symmetry: f(x, y) = f(y, x). 

3. Triangle inequality: f(x, j) <f(x, z) + /(z, j) for any z. 

A function that satisfies these three conditions is called a metric. That the 
distance d(y p Vj) between two vertices of a connected graph satisfies condi¬ 
tions 1 and 2 is immediately evident. Since d(v n v 3 ) is the length of the short¬ 
est path between vertices v t and v p this path cannot be longer than another 
path between v t and v p which goes through a specified vertex v k . Hence d{v ( , 
Vj) < d(v n v k ) + d(v k , Vj). Therefore, 


Theorem 3-8 

The distance between vertices of a connected graph is a metric. 

Coming back to our original topic of relative location of different vertices 
in a tree, let us define another term called eccentricity (also referred to as 
associated number or separation) of a vertex in a graph. 

The eccentricity E(v) of a vertex v in a graph G is the distance from v to 
the vertex farthest from v in G; that is, 

E(v) = max d(v, v t ). 

Vt€G 

A vertex with minimum eccentricity in graph G is called a center of G. The 
eccentricities of the four vertices in Fig. 3-7 are E(a) = 2, E(b) = 1, E(c) = 2, 
and E(d) = 2. Hence vertex b is the center of that tree. On the other hand, 
consider the tree in Fig. 3-9. The eccentricity of each of its six vertices is shown 
next to the vertex. This tree has two vertices having the same minimum 
eccentricity. Hence this tree has two centers. Some authors refer to such cen¬ 
ters as bicenters; we shall call them just centers, because there will be no 
occasion for confusion. 

The reader can easily verify that a graph, in general, has many centers. 
For example, in a graph that consists of just a circuit (a polygon), every vertex 
is a center. In the case of a tree, however, Konig [1-7] proved the following 
theorem: 

Theorem 3-9 

Every tree has either one or two centers. 



Fig 3-9 Eccentricities of the vertices of 
a tree. 
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• Center 
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(d) 

Fig. 3-10 Finding a center of a tree. 

Proof: The maximum distance, max d(v, v t ), from a given vertex v to any 
other vertex v ( occurs only when v t is a pendant vertex. With this observation, let 
us start with a tree T having more than two vertices. Tree T must have two or 
more pendant vertices (Theorem 3-7). Delete all the pendant vertices from T. The 
resulting graph T' is still a tree. What about the eccentricities of the vertices in T"! 
A little deliberation will reveal that removal of all pendant vertices from T uniformly 
reduced the eccentricities of the remaining vertices (i.e., vertices in T') by one. 
Therefore, all vertices that T had as centers will still remain centers in T'. From 
T' we can again remove all pendant vertices and get another tree T". We continue 
this process (which is illustrated in Fig. 3-10) until there is left either a vertex 
(which is the center of T ) or an edge (whose end vertices are the two centers of T). 
Thus the theorem. ■ 
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Corollary 

From the argument used in proving Theorem 3-9, we see that if a tree Thas two 
centers, the two centers must be adjacent. 

A Sociological Application: Suppose that the communication among a 
group of 14 persons in a society is represented by the graph in Fig. 3-10(a), 
where the vertices represent the persons and an edge represents the communi¬ 
cation link between its two end vertices. Since the graph is connected, we 
know that all the members can be reached by any member, either directly 
or through some other members. But it is also important to note that the 
graph is a tree—minimally connected. The group cannot afford to lose any 
of the communication links. 

The eccentricity of each vertex, E(v), represents how close v is to the farth¬ 
est member of the group. In Fig. 3-10(a), vertex c should be the leader of the 
group, if closeness of communication were the criterion for leadership. 

Radius and Diameter: If a tree has a center (or two centers), does it have 
a radius also? Yes. The eccentricity of a center (which is the distance from the 
center of the tree to the farthest vertex) in a tree is defined as the radius of the 
tree. For instance, the radius of the tree in Fig. 3-10(a) is three. The diameter 
of a tree T, on the other hand, is defined as the length of the longest path in 
T. It is left as an exercise for the reader (Problem 3-6) to show that a radius 
in a tree is not necessarily half its diameter. 


3-5. ROOTED AND BINARY TREES 

A tree in which one vertex (called the root ) is distinguished from all the 
others is called a rooted tree. For instance, in Fig. 3-3 vertex N, from where 
all the mail goes out, is distinguished from the rest of the vertices. Flence N 
can be considered the root of the tree, and so the tree is rooted. Similarly, in 
Fig. 3-6 the start vertex may be considered as the root of the tree shown. In 
a diagram of a rooted tree, the root is generally marked distinctly. We will 
show the root enclosed in a small triangle. All rooted trees with four vertices 
are shown in Fig. 3-11. Generally, the term tree means trees without any root. 
However, for emphasis they are sometimes called free trees (or nonrooted 
trees) to differentiate them from the rooted kind. 






Fig. 3-11 Rooted trees with four vertices. 
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Binary Trees: A special class of rooted trees, called binary rooted trees, is 
of particular interest, since they are extensively used in the study of computer 
search methods, binary identification problems, and variable-length binary 
codes. A binary tree is defined as a tree in which there is exactly one vertex of 
degree two, and each of the remaining vertices is of degree one or three (Fig. 
3-12). (Obviously, we are talking about trees with three or more vertices.) 
Since the vertex of degree two is distinct from all other vertices, this vertex 
serves as a root. Thus every binary tree is a rooted tree. Two properties of 
binary trees follow directly from the definition: 

1. The number of vertices n in a binary tree is always odd. This is because 
there is exactly one vertex of even degree, and the remaining n — 1 vertices 
are of odd degrees. Since from Theorem 1-1 the number of vertices of odd 
degrees is even, n — 1 is even. Hence n is odd. 

2. Let p be the number of pendant vertices in a binary tree T. Then 
n — p — 1 is the number of vertices of degree three. Therefore, the number 
of edges in T equals 

\[P + 3(« - p - 1) + 2] = n - 1; 


hence 



(3-1) 


A nonpendant vertex in a tree is called an internal vertex. It follows from 
Eq. (3-1) that the number of internal vertices in a binary tree is one less than 
the number of pendant vertices. In a binary tree a vertex v t is said to be at 
level l; if v i is at a distance of /,. from the root. Thus the root is at level 0. A 
13-vertex, four-level binary tree is shown in Fig. 3-12. The number of vertices 
at levels 1, 2, 3, and 4 are 2, 2, 4, and 4, respectively. 

One of the most straightforward applications of binary trees is in search 
procedures. Each vertex of a binary tree represents a test with two possible 



Level 4 


Fig. 3-12 A 13-vertex, 4-level binary tree. 
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outcomes. We start at the root, and the outcome of the test at the root sends 
us to one of the two vertices at the next level, where further tests are made, 
and so on. Reaching a specified pendant vertex (the goal of the search) termi¬ 
nates the search. For such a search procedure it is often important to con¬ 
struct a binary tree in which, for a given number of vertices n, the vertex 
farthest from the root is as close to the root as possible. Clearly, there can be 
only one vertex (the root) at level 0, at most two vertices at level 1, at most 
four vertices at level 2, and so on. Therefore, the maximum number of vertices 
possible in a &-level binary tree is 

2 ° + 2 1 + 2 2 + • • • + 2 k > n . 

The maximum level, / max , of any vertex in a binary tree is called the height 
of the tree. It is easy to see that the minimum possible height of an w-vertex 
binary tree is 

min / max = flog 2 (« + 1) - 11, (3-2) 

where \n] denotes the smallest integer greater than or equal to n. 

On the other hand, to construct a binary tree for a given n such that the 
farthest vertex is as far as possible from the root, we must have exactly two 
vertices at each level, except at the 0 level. Therefore, 

H — 1 

max / raax = — (3-3) 

For n = 11, binary trees realizing both these extremes are shown in Fig. 
3-13. 


Level 
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1 

2 

3 
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min/ rna* = r(log 2 12) - 11 
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mu x 
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(a) 


(b) 


Fig. 3-13 Two 11-vertex binary trees. 
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In analysis of algorithms we are generally interested in computing the sum 
of the levels of all pendant vertices. This quantity, known as the path length 
(or external path length) of a tree , can be defined as the sum of the path 
lengths from the root to all pendant vertices. The path length of the binary 
tree in Fig. 3-12, for example, is 

1 + 3 + 3 + 4 + 4 + 4 + 4-23. 

The path lengths of trees in Figs. 3-13(a) and (b) are 16 and 20, respectively. 
The importance of the path length of a tree lies in the fact that this quantity 
is often directly related to the execution time of an algorithm. 

It can be shown that the type of binary tree in Fig. 3-13(a) (i.e., a tree with 
vertices at level / ma * — 1) yields the minimum path length for a given 
n. 

Weighted Path Length: In some applications, every pendant vertex v } of 
a binary tree has associated with it a positive real number Wj. Given w,, 
w 2 , . . ., w m the problem is to construct a binary tree (with m pendant ver¬ 
tices) that minimizes 

2 Wy/y, 

where /,. is the level of pendant vertex v jf and the sum is taken over all pendant 
vertices. Let us illustrate the significance of this problem with a simple exam¬ 
ple. 

A Coke machine is to identify, by a sequence of tests, the coin that is put 
into the machine. Only pennies, nickels, dimes, and quarters can go through 
the slot. Let us assume that the probabilities of a coin being a penny, a nickel, 
a dime, and a quarter are .05, .15, .5, and .30, respectively. Each test has the 
effect of partitioning the four types of coins into two complementary sets and 
asserting the unknown coin to be in one of the two sets. Thus for four coins 
we have 2 3 — 1 such tests. If the time taken for each test is the same, what 
sequence of tests will minimize the expected time taken by the Coke machine 
to identify the coin? 

The solution requires the construction of a binary tree with four pendant 
vertices (and therefore three internal vertices) v u v 2 , v 3 , and v 4 and corre¬ 
sponding weights w, = .05, w 2 = .15, w 3 = .5, and w 4 = .3, such that the 
quantity ^ h w i * s minimized. The solution is given in Fig. 3-14(a), for which 
the expected time is 1 .It, where t is the time taken for each test. Contrast this 
with Fig. 3-14(b), for which the expected time is 2 1. An algorithm for con¬ 
structing a binary tree with minimum weighted path length can be found in 
[3-6]. 

In this problem of a Coke machine, many interesting variations are pos¬ 
sible. For example, not all possible tests may be available, or they may not all 
consume the same time. 

Binary trees with minimum weighted path length have also been used in 
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Z w, • /, = 1.7 Z w, • /, = 2 

(a) (b) 

Fig. 3-14 Two binary trees with weighted pendant vertices. 

constructing variable-length binary codes, where the letters of the alphabet 
(A, B, C, . . . , Z) are represented by binary digits. Since different letters have 
different frequencies of occurrence (frequencies are interpreted as weights 
w,, w 2 ,. . . , w 26 ), a binary tree with minimum weighted path length corre¬ 
sponds to a binary code of minimum cost; see [3-6]. For more on minimum- 
path binary trees and their applications the reader is referred to [3-5] and 
[3-7]. 


3-6. ON COUNTING TREES 

In 1857, Arthur Cayley discovered trees while he was trying to count the 
number of structural isomers of the saturated hydrocarbons (or paraffin 
series) C fc H 2/t+2 . He used a connected graph to represent the C k H 2k+2 mole¬ 
cule. Corresponding to their chemical valencies, a carbon atom was repre¬ 
sented by a vertex of degree four and a hydrogen atom by a vertex of degree 
one (pendant vertices). The total number of vertices in such a graph is 

n = 3k + 2, 

and the total number of edges is 

e = -^-(sum of degrees) = -^-(4 k + 2k + 2) 

= 3k + 1. 

Since the graph is connected and the number of edges is one less than the 
number of vertices, it is a tree. Thus the problem of counting structural 
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isomers of a given hydrocarbon becomes the problem of counting trees (with 
certain qualifying properties, to be sure). 

The first question Cayley asked was: what is the number of different trees 
that one can construct with n distinct (or labeled) vertices? If n = 4, for 
instance, we have 16 trees, as shown in Fig. 3-15. The reader can satisfy him¬ 
self that there are no more trees of four vertices. (Of course, some of these 
trees are isomorphic—to be discussed later.) 

A graph in which each vertex is assigned a unique name or label (i.e., no 
two vertices have the same label), as in Fig. 3-15, is called a labeled graph. 
The distinction between a labeled and an unlabeled graph is very important 
when we are counting the number of different graphs. For instance, the four 
graphs in the first row in Fig. 3-15 are counted as four different trees (even 
though they are isomorphic) only because the vertices are labeled. If there 



Fig. 3-15 All 16 trees of four labeled vertices, 
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were no distinction made between A, B, C, or D, these four trees would be 
counted as one. A careful inspection of the graphs in Fig. 3-15 will reveal that 
the number of unlabeled trees with four vertices (no distinction made between 
A, B, C, and D ) is only two. But first we shall continue with counting labeled 
trees. 

The following well-known theorem for counting trees was first stated and 
proved by Cayley, and is therefore called Cayley’s theorem. 

Theorem 3-10 

The number of labeled trees with n vertices (n > 2) is n n ~ 2 . 

Proof: The result was first stated and proved by Cayley. Many different proofs 
with various approaches (all somewhat involved) have been published since. An 
excellent summary of 10 such proofs is given by Moon [3-9]. We will give one 
proof in Chapter 10. 

Unlabeled Trees: In the actual counting of isomers of C k W 2k+2 , Theorem 
3-10 is not enough. In addition to the constraints on the degree of the vertices, 
two observations should be made: 

1. Since the vertices representing hydrogen are pendant, they go with 
carbon atoms only one way, and hence make no contribution to isomerism. 
Therefore, we need not show any hydrogen vertices. 

2. Thus the tree representing C fc H 2fc+2 reduces to one with k vertices, 
each representing a carbon atom. In this tree no distinction can be made 
between vertices, and therefore it is unlabeled. 

Thus for butane, C 4 H 10 , there are only two distinct trees (Fig. 3-16). As 
every organic chemist knows, there are indeed exactly two different types of 
butanes: w-butane and isobutane. It may be noted in passing that the four 
trees in the first row of Fig. 3-15 are isomorphic to the one in Fig. 3-16(a); 
and the other 12 are isomorphic to Fig. 3-16(b). 



Fig. 3-16 All trees of four unlabeled vertices. 
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The problem of counting trees of different types will be taken up again 
and discussed more thoroughly in Chapter 10. 


3-7. SPANNING TREES 

So far we have discussed the tree and its properties when it occurs as a 
graph by itself. Now we shall study the tree as a subgraph of another graph. 
A given graph has numerous subgraphs—from e edges, 2 e distinct combina¬ 
tions are possible. Obviously, some of these subgraphs will be trees. Out of 
these trees we are particularly interested in certain types of trees, called 
spanning trees —as defined next. 

A tree T is said to be a spanning tree of a connected graph G if T is a 
subgraph of G and T contains all vertices of G. For instance, the subgraph in 
heavy lines in Fig. 3-17 is a spanning tree of the graph shown. 

Since the vertices of G are barely hanging together in a spanning tree, it is 
a sort of skeleton of the original graph G. This is why a spanning tree is some¬ 
times referred to as a skeleton or scaffolding of G. Since spanning trees are the 
largest (with maximum number of edges) trees among all trees in G, it is also 
quite appropriate to call a spanning tree a maximal tree subgraph or maximal 
tree of G. 

It is to be noted that a spanning tree is defined only for a connected graph, 
because a tree is always connected, and in a disconnected graph of n vertices 
we cannot find a connected subgraph with n vertices. Each component (which 
by definition is connected) of a disconnected graph, however, does have a 
spanning tree. Thus a disconnected graph with k components has a spanning 
forest consisting of k spanning trees. (A collection of trees is called a forest .) 

Finding a spanning tree of a connected graph G is simple. If G has no cir¬ 
cuit, it is its own spanning tree. If G has a circuit, delete an edge from the 
circuit. This will still leave the graph connected (Problem 2-10). If there are 
more circuits, repeat the operation till an edge from the last circuit is delet¬ 
ed—leaving a connected, circuit-free graph that contains all the vertices of G. 
Thus we have 



Fig. 3-17 Spanning tree. 
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Theorem 3-11 

Every connected graph has at least one spanning tree. 

An edge in a spanning tree T is called a branch of T. An edge of G that is not 
in a given spanning tree T is called a chord. In electrical engineering a chord is 
sometimes referred to as a tie or a link. For instance,edges b x , b 2 , b 3 , b A , b 5 , 
and b 6 are branches of the spanning tree shown in Fig. 3-17, while edges 
c t , c 2 , c 3 , c 4 , c 5 , c 6 , c 7 , and c 8 are chords. It must be kept in mind that bran¬ 
ches and chords are defined only with respect to a given spanning tree. An 
edge that is a branch of one spanning tree T { (in a graph G ) may be a chord 
with respect to another spanning tree T 2 . 

It is sometimes convenient to consider a connected graph G as a union of 
two subgraphs, T and f \ that is, 

T U f = G, 

where T is a spanning tree, and f is the complement of T in G. Since the sub¬ 
graph f is the collection of chords, it is quite appropriately referred to as the 
chord set (or tie set or cotree) of T. From the definition, and from Theorem 
3-3, the following theorem is evident. 

Theorem 3-12 

With respect to any of its spanning trees, a connected graph of n vertices and 
e edges has n — 1 tree branches and e — n + 1 chords. 

For example, the graph in Fig. 3-17 (with n -= 7, e = 14), has six tree 
branches and eight chords with respect to the spanning tree {b x , b 2 , b 3 , b A , b 5 , 
b 6 }. Any other spanning tree will yield the same numbers. 

If we have an electric network with e elements (edges) and n nodes (ver¬ 
tices), what is the minimum number of elements we must remove to eliminate 
all circuits in the network? The answer is e — n -f 1. Similarly, if we have a 
farm consisting of six walled plots of land, as shown in Fig. 3-18, and these 
plots are full of water, how many walls will have to be broken so that all the 
water can be drained out? Here n = 10 and e 15. We shall have to select 



Fig. 3-18 Farm with walled plots of 
land. 



sec. 3-8 


FUNDAMENTAL CIRCUITS 


57 


a set of six (15 — 10 -f 1 =6) walls such that the remaining nine constitute 
a spanning tree. Breaking these six walls will drain the water out. 

Rank and Nullity: When someone specifies a graph G, the first thing he is 
most likely to mention is n, the number of vertices in G. Immediately follow¬ 
ing comes e, the number of edges in G. Then k, the number of components G 
has. If k = 1, G is connected. How are these three numbers of a graph relat¬ 
ed ? Since every component of a graph must have at least one vertex, « > k. 
Moreoever, the number of edges in a component can be no less than the num¬ 
ber of vertices in that component minus one. Therefore, e > n — k. Apart 
from the constraints n — k > 0 and e — n + k > 0, these three numbers 
n, e, and k are independent, and they are fundamental numbers in graphs. 
(Needless to mention, these numbers alone are not enough to specify a graph, 
except for trivial cases.) 

From these three numbers are derived two other important numbers 
called rank and nullity , defined as 

rank r — n — k, 
nullity fi = e — n + k. 

The rank of a connected graph is n — 1, and the nullity, e — n + 1. Although 
the real significance of these numbers will be clear in Chapter 7, it may be 
observed here that 

rank of G = number of branches in any spanning 
tree (or forest) of G, 
nullity of G = number of chords in G, 
rank + nullity = number of edges in G. 

The nullity of a graph is also referred to as its cyclomatic number, or first 
Betti number. 


3-8. FUNDAMENTAL CIRCUITS 

You may have noticed that if we add an edge between any two vertices of 
a tree (say, in Fig. 3-1) a circuit is created. This is because there already exists 
one path between any two vertices of a tree; adding an edge between them 
creates an additional path, and hence a circuit. Along this line of reasoning, 
it is not difficult to prove 

Theorem 3-13 

A connected graph G is a tree if and only if adding an edge between any two 
vertices in G creates exactly one circuit. 
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Let us now consider a spanning tree T in a connected graph G. Adding 
any one chord to T will create exactly one circuit. Such a circuit, formed by 
adding a chord to a spanning tree, is called a fundamental circuit. 

How many fundamental circuits does a graph have? Exactly as many as 
the number of chords, ja (= e — n 4- k). How many circuits does a graph 
have in all ? We know that one circuit is created by adding any one chord to a 
tree. Suppose that we add one more chord. Will it create exactly one more 
circuit? What happens if we add all the chords simultaneously to the tree? 

Let us look at the tree { b ,, b 2 , b 3 , Z> 4 , b 5 , b 6 } in Fig. 3-17. Adding c x to it, 
we get a subgraph {b x , b 2 , b 3 , b 4 , b 5 , b 6 , c,), which has one circuit (fundamen¬ 
tal circuit), {b x , b 2 , b 3 , b 5 , c,}. Had we added the chord c 2 (instead of c,) to the 
tree, we would have obtained a different fundamental circuit, {b 2 , b 3 , b 5 , c 2 }. 
Now suppose that we add both chords c x and c 2 to the tree. The subgraph 
{b x , b 2 , b 3 , Z? 4 , b 5 , b 6 , c x , c 2 ] has not only the fundamental circuits we just 
mentioned, but it has also a third circuit, {,b x , c x , c 2 ), which is not a funda¬ 
mental circuit. Although there are 75 circuits in Fig. 3-17 (enumerated by 
computer), only eight are fundamental circuits, each formed by one chord 
(together with the tree branches). 

Two comments may be appropriate here. First, a circuit is a fundamental 
circuit only with respect to a given spanning tree. A given circuit may be fun¬ 
damental with respect to one spanning tree, but not with respect to a different 
spanning tree of the same graph. Although the number of fundamental 
circuits (as well as the total number of circuits) in a graph is fixed, the cir¬ 
cuits that become fundamental change with the spanning trees. 

Second, in most applications we are not interested in all the circuits of 
a graph, but only in a set of fundamental circuits, which fortuitously are 
a lot easier to track. The concept of a fundamental circuit, introduced by 
Kirchhoff, is of enormous significance in electrical network analysis. What 
Kirchhoff showed, which now every sophomore in electrical engineering 
knows, is that no matter how many circuits a network contains we need con¬ 
sider only fundamental circuits with respect to any spanning tree. The rest 
of the circuits (as we shall prove rigorously in Chapter 7) are combinations of 
some fundamental circuits. 


3-9. FINDING ALL SPANNING TREES OF A GRAPH 

Usually, in a given connected graph there are a large number of spanning 
trees. In many applications we require all spanning trees. One reasonable 
way to generate spanning trees of a graph is to start with a given spanning 
tree, say tree T x {abed in Fig. 3-19). Add a chord, say h, to the tree T x . This 
forms a fundamental circuit {be hd in Fig. 3-19). Removal of any branch, 
say c, from the fundamental circuit b c h d just formed will create a new 



sec. 3-9 


FINDING ALL SPANNING TREES OF A GRAPH 


59 



7J T 2 T, 

Fig. 3-19 Graph and three of its spanning trees. 


spanning tree T t . This generation of one spanning tree from another, through 
addition of a chord and deletion of an appropriate branch, is called a cyclic 
interchange or elementary tree transformation. (Such a transformation is a 
standard operation in the iteration sequence for solving certain transporta¬ 
tion problems.) 

In the above procedure, instead of deleting branch c, we could have de¬ 
leted d or b and thus would have obtained two additional spanning trees 
a b c h and a c h d. Moreover, after generating these three trees, each with 
chord h in it, we can restart with T x and add a different chord ( e,f or g) and 
repeat the process of obtaining a different spanning tree each time a branch 
is deleted from the fundamental circuit formed. Thus we have a procedure 
for generating spanning trees for any given graph. 

As we shall see in Chapter 13, the topological analysis of a linear elec¬ 
trical network essentially reduces to the generation of trees in the correspond¬ 
ing graph. Therefore, finding an efficient procedure for generating all trees 
of a graph is a very important practical problem. 

The procedure outlined above raises many questions. Can we start from 
any spanning tree and get a desired spanning tree by a number of cyclic 
exchanges ? Can we get all spanning trees of a given graph in this fashion? 
How long will we have to continue exchanging edges? Out of all possible 
spanning trees that we can start with, is there a preferred one for starting? 
Let us try to answer some of these questions; others will have to wait until 
Chapters 7, 10, and 11. 

The distance between two spanning trees T t and T } of a graph G is defined 
as the number of edges of G present in one tree but not in the other. This 
distance may be written as d(T t , Tj). For instance, in Fig. 3-19 d(T 2 , T 3 ) = 3. 

Let T. @ Tj be the ring sum of two spanning trees T t and Tj of G (as 
defined in Chapter 2, T t @ T } is the subgraph of G containing all edges of 
G that are either in T. or in 7) but not in both). Let N(g) denote the number 
of edges in a graph g. Then, from definition, 

d(T„ Tj) = \n(T, © Tj). 
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It is not difficult to see that the number d(T n T } ) is the minimum number of 
cyclic interchanges involved in going from T t to T } . The reader is encouraged 
to prove the following two theorems. 

Theorem 3-14 

The distance between the spanning trees of a graph is a metric. That is, it satisfies 

d(T„ Tj) > 0 and d(T h Tj) - 0 if and only if 7) = T h 

d(T h Tj) = d(Tj, Tj), 

d(T h Tj) < d(T h T k ) 4 d(T k , Tj). 


Theorem 3-15 

Starting from any spanning tree of a graph G, we can obtain every spanning 
tree of G by successive cyclic exchanges. 

Since in a connected graph G of rank r (i.e., of r + 1 vertices) a spanning 
tree has r edges, we have the following results: 

The maximum distance between any two spanning trees in G is 

max d(T n Tj ) = -y max N(T, © Tj) 

< r, the rank of G. 

Also, if fj, is the nullity of G, we know that no more than p edges of a span¬ 
ning tree 7) can be replaced to get another tree T). 

Hence max d(T., Tj) < li \ 

combining the two, 

max <7(7), Tj) < min (jli, r), 

where min(//, r) is the smaller of the two numbers fi and r of the graph G. 
Central Tree: For a spanning tree T 0 of a graph G , let max d(T 0 , Tj) 

i 

denote the maximal distance between T 0 and any other spanning tree of G. 
Then T 0 is called a central tree of G if 

max d(T 0 , Tj) < max d(T, Tj) for every tree T of G. 

i j 

The concept of a central tree is useful in enumerating all trees of a given 
graph. A central tree in a graph is, in general, not unique. For more on cen¬ 
tral trees the reader should see [3-1] and [3-4]. 

Tree Graph: The tree graph of a given graph G is defined as a graph in 
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which each vertex corresponds to a spanning tree of G, and each edge cor¬ 
responds to a cyclic interchange between the spanning trees of G represented 
by the two end vertices of the edge. From Theorem 3-15 we know that start¬ 
ing from any spanning tree we can obtain all other spanning trees through 
cyclic interchanges (or elementary tree transformations). Therefore, the tree 
graph of any given (finite, connected) graph is connected. For additional 
properties of tree graphs, the reader should see [3-3]. 


3-10. SPANNING TREES IN A WEIGHTED GRAPH 

As discussed earlier in this chapter, a spanning tree in a graph G is a 
minimal subgraph connecting all the vertices of G. If graph G is a weighted 
graph (i.e., if there is a real number associated with each edge of G ), then 
the weight of a spanning tree T of G is defined as the sum of the weights of 
all the branches in T. In general, different spanning trees of G will have 
different weights. Among all the spanning trees of G, one with the smallest 
weight is of practical significance. (There may be several spanning trees with 
the smallest weight; for instance, in a graph of n vertices in which every edge 
has unit weight, all spanning trees have a weight of n — 1 units.) A spanning 
tree with the smallest weight in a weighted graph is called a shortest spanning 
tree or shortest-distance spanning tree or minimal spanning tree. 

One possible application of the shortest spanning tree is as follows: Sup¬ 
pose that we are to connect n cities v i ,v 2 , . . . , v n through a network of roads. 
The cost c u of building a direct road between v ( and v } is given for all pairs of 
cities where roads can be built. (There may be pairs of cities between which 
no direct road can be built.) The problem is then to find the least expensive 
network that connects all n cities together. It is immediately evident that this 
connected network must be a tree: otherwise, we can always remove some 
edges and get a connected graph with smaller weight. Thus the problem of 
connecting n cities with a least expensive network is the problem of finding 
a shortest spanning tree in a connected weighted graph of n vertices. A neces¬ 
sary and sufficient condition for a spanning tree to be shortest is given by 

Theorem 3-16 

A spanning tree T(of a given weighted connected graph G) is a shortest spanning 
tree (of G) if and only if there exists no other spanning tree (of G) at a distance of 
one from T whose weight is smaller than that of T. 

Proof: The necessary or the “only if” condition is obvious; otherwise, we shall 
get another tree shorter than T by a cyclic interchange. The fact that this condition 
is also sufficient is remarkable and is not obvious. It can be proved as follows: 

Let T x be a spanning tree in G satisfying the hypothesis of the theorem (i.e., 
there is no spanning tree at a distance of one from T x which is shorter than Ti). 
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The proof will be completed by showing that if T 2 is a shortest spanning tree (dif¬ 
ferent from T x ) in G, the weight of T x will also be equal to that of T 2 . Let T 2 be 
a shortest spanning tree in G. Clearly, T 2 must also satisfy the hypothesis of the 
theorem (otherwise there will be a spanning tree shorter than T 2 at a distance of 
one from T 2y violating the assumption that T 2 is shortest). 

Consider an edge e in T 2 which is not in T x . Adding etoTi forms a fundamental 
circuit with branches in T x . Some, but not all, of the branches in T x that form the 
fundamental circuit with e may also be in T 2 ; each of these branches in T x has 
a weight smaller than or equal to that of e, because of the assumption on T x . 
Amongst all those edges in this circuit which are not in T 2 at least one, say b if must 
form some fundamental circuit (with respect to T 2 ) containing e. Because of the 
minimality assumption on T 2 weight of b j cannot be less than that of e. Therefore 
bj must have the same weight as e. Hence the spanning tree T[ — (T x \Je — b j), 
obtained from T x through one cycle exchange, has the same weight as T x . But 
T x has one edge more in common with T 2 , and it satisfies the condition of Theorem 
3-16. This argument can be repeated, producing a series of trees of equal weight, 
T x , T x , T x , . . . , each a unit distance closer to T 2 , until we get T 2 itself. 

This proves that if none of the spanning trees at a unit distance from Tis shorter 
than T, no spanning tree shorter than T exists in the graph. ■ 

Algorithm for Shortest Spanning Tree: There are several methods available 
for actually finding a shortest spanning tree in a given graph, both by hand 
and by computer. One algorithm due to Kruskal [3-8] is as follows: List all 
edges of the graph G in order of nondecreasing weight. Next, select a smallest 
edge of G. Then for each successive step select (from all remaining edges of 
G) another smallest edge that makes no circuit with the previously selected 
edges. Continue until n — 1 edges have been selected, and these edges will 
constitute the desired shortest spanning tree. The validity of the method 
follows from Theorem 3-16. 

Another algorithm, which does not require listing all edges in order of 
nondecreasing weight or checking at each step if a newly selected edge forms 
a circuit, is due to Prim [3-10]. For Prim’s algorithm, draw n isolated vertices 
and label them v,, v 2 , . . . , v n . Tabulate the given weights of the edges of G 
in an n by n table. (Note that the entries in the table are symmetric with re¬ 
spect to the diagonal, and the diagonal is empty.) Set the weights of non¬ 
existent edges (corresponding to those pairs of cities between which no direct 
road can be built) as very large. 

Start from vertex and connect it to its nearest neighbor (i.e., to the 
vertex which has the smallest entry in row 1 of the table), say v k . Now con¬ 
sider and v k as one subgraph, and connect this subgraph to its closest 
neighbor (i.e., to a vertex other than v x and v k that has the smallest entry 
among all entries in rows 1 and k ). Let this new vertex be v r Next regard the 
tree with vertices v x , v k , and v. as one subgraph, and continue the process 
until all n vertices have been connected by n — 1 edges. Let us now illustrate 
this method of finding a shortest spanning tree. 
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(a) (b) 

Fig. 3-20 Shortest spanning tree in a weighted graph. 


A connected weighted graph with 6 vertices and 12 edges is shown in Fig. 
3-20(a). The weight of its edges is tabulated in Fig. 3-20(b). We start with v { 
and pick the smallest entry in row 1, which is either (v,, v 2 ) or (v x , v 5 ). Let us 
pick (v t ,v 5 ). [Had we picked (v lt v 2 ) we would have obtained a different 
shortest tree with the same weight.] The closest neighbor of subgraph (v,, v 5 ) 
is v 4 , as can be seen by examining all the entries in rows 1 and 5. The three re¬ 
maining edges selected following the above procedure turn out to be (v 4 , v 6 ), 
(v 4 , v 3 ), and (v 3 , v 2 ) in that sequence. The resulting tree—a shortest spanning 
tree—is shown in Fig. 3-20(a) in heavy lines. The weight of this tree is 41.5 
units. 

Degree-Constrained Shortest Spanning Tree: In a shortest spanning tree 
resulting from the preceding construction, a vertex v t can end up with any 
degree; that is, 1 < d(v t ) < n — 1. In some practical cases an upper limit on 
the degree of every vertex (of the resulting spanning tree) has to be imposed. 
For instance, in an electrical wiring problem, one may be required to wire 
together n pins (using as little wire as possible) with no more than three wires 
wrapped around any individual pin. Thus, in this particular case, 

d(v^) <3 for every v t . 

Such a spanning tree is called a degree-constrained shortest spanning tree. 

In general, the problem may be stated as follows: Given a weighted con¬ 
nected graph G, find a shortest spanning tree TinG such that 

d{v) < k for every vertex v ; in T. 

If k = 2, this problem, in fact, reduces to the problem of finding the shortest 
Hamiltonian path, as well as the traveling-salesman problem (without the 
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salesman returning to his home base), discussed at the end of Chapter 2. So 
far, no efficient method of finding an arbitrarily degree-constrained shortest 
spanning tree has been found. 


SUMMARY 

This chapter dealt with a particular type of connected graph called a tree. 
Because of their wide applications, trees form the most important topic in 
graph theory. Different types of trees, such as labeled and unlabeled, rooted 
and unrooted, were discussed, together with their properties and applications. 

Of special interest are those trees that are subgraphs of a given connected 
graph G containing all vertices of G. Such trees are called spanning trees of 
G. Finding all spanning trees of a given graph is of great practical importance, 
and so is the problem of finding a shortest spanning tree in a given weighted 
graph. 

Other related concepts, such as centers, radius, and diameter of a tree, 
rank and nullity of a graph, fundamental circuits, branches and chords, 
cyclic interchange, distance between spanning trees, and tree graphs, were 
also introduced and studied. Trees, spanning trees, and fundamental circuits 
will continue to appear from time to time in most of the succeeding chapters. 
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PROBLEMS 

3-1. Draw all trees of n labeled vertices for n = 1, 2, 3, 4, and 5. 

3-2. Draw all trees of n unlabeled vertices for n = 1,2, 3, 4, and 5. 

3-3. Draw all unlabeled rooted trees of n vertices for n = 1 , 2 , 3, 4, and 5. 

3-4. It can be shown that there are only six different (nonisomorphic) trees of six vertices. 

Two such trees are given in Fig. 2-4. Draw the other four. 

3-5. Prove Theorem 3-4. 

3-6. Show a tree in which its diameter is not equal to twice the radius. Under what 
condition does this inequality hold? Elaborate. 

3-7. Cite three different situations (games, activities, or problems) that can be represented 
by trees. Explain. 

3-8. How many isomers does pentane C 5 H 12 have? Hexane, C 6 Hm? 

3-9. Suppose you are given eight coins and are told that seven of them are of equal 
weight, and one coin is either heavier or lighter than the rest. You are provided 
with an equal-arm balance, which you may use only three times, for comparing 
coins. Sketch a strategy in the form of a decision tree for identifying the noncon¬ 
forming coin, as well as for finding out whether it is heavier or lighter than the rest. 

3-10. Sketch all (unlabeled) binary trees with six pendant vertices. Find the path length 
of each. [Hint: Distribute the 11 vertices (because n = 6 -I- 5) among different 
levels. Observe that level 0 has exactly one vertex, level 1 has exactly two vertices; 
level 2 can have either two or four vertices; and so on. There are six such trees, and 
two of them are shown in Fig. 3-13.] 

3-11. Sketch all spanning trees of the graph in Fig. 2-1. 

3-12. Show that a path is its own spanning tree. 
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3-13. Prove that a pendant edge (an edge whose one end vertex is of degree one) in a 
connected graph G is contained in every spanning tree of G. 

3-14. Prove that any subgraph g of a connected graph G is contained in some span¬ 
ning tree of G if and only if g contains no circuit. 

3-15. What is the nullity of a complete graph of n vertices? 

3-16. Show that a Hamiltonian path is a spanning tree. 

3-17. Prove that any circuit in a graph G must have at least one edge in common with 
a chord set. 

3-18. Prove Theorem 3-13. 

3-19. Find a spanning tree at a distance of four from spanning tree {b \, bi, bi,bi,b$, be} 
in Fig. 3-17. List all fundamental circuits with respect to this new spanning tree. 

3-20. Show that the distance between two spanning trees as defined in this chapter is a 
metric. 

3-21. Can you construct a graph if you are given all its spanning trees? How? 

3-22. Prove that the nullity of a graph does not change when you either insert a vertex 
in the middle of an edge, or remove a vertex of degree two by merging two edges 
incident on it. 

3-23. Prove that .any given edge of a connected graph G is a branch of some spanning 
tree of G. Is it also true that any arbitrary edge of G is a chord for some spanning 
tree of G ? 

3-24. Suggest a method for determining the total number of spanning trees of a connected 
graph without actually listing them. 

3-25. Prove that two colors are necessary and sufficient to paint all n vertices {n > 2) of 
a tree, such that no edge in the tree has both of its end vertices of the same color. 
(This fact is expressed by the statement that the chromatic number of a tree is two.) 

3-26. Suppose that you are given a set of n positive integers. State some necessary condi¬ 
tions of this set so that the set can be the degrees of all the n vertices of a tree. 
Are these conditions sufficient also? 

3-27. Let v be a vertex in a connected graph G. Prove that there exists a spanning tree T 
in G such that the distance of every vertex from v is the same both in G and in T. 

3-28. Let T\ and Tz be two spanning trees of a connected graph G. If edge e is in T\ 
but not in Tz, prove that there exists another edge /in Ti but not in T 1 such that 
subgraphs (T 1 — e) U / and (Tz — /) U e are also spanning trees of G. 

3-29. Construct a tree graph (with 16 vertices, each corresponding to a tree in Fig. 3-15) 
of a labeled complete graph of four vertices. 

3-30. In the tree graph obtained in Problem 3-29, observe the following properties (dis¬ 
covered by R. L. Cummins). A tree graph has at least one Hamiltonian circuit, 
and an arbitrary edge of a tree graph can be included in a Hamiltonian circuit. 

3-31. In a given connected weighted graph C, suppose there exists an edge e s whose 
weight is smaller than that of any other in G. Prove that every shortest spanning 
tree in G must contain e s . 

3-32. Let G be a connected weighted graph in which every edge belongs to some circuit. 
If ei is the edge with weight greater than that of any other edge in G, show that no 
shortest spanning tree in G will contain <?/. 

3-33. Show by constructing counterexamples that in Problems 3-31 and 3-32 the same 
cannot be said of the second smallest and the second largest edges, respectively. 
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3-34. Use the algorithm of Kruskal, as outlined in this chapter, to find a shortest spanning 
tree in the graph of Fig. 3-20(a). 

3-35. Pick 15 large cities in the United States and obtain the 105 intercity distances from 
an atlas. Find the shortest spanning tree connecting these cities by using (a) 
KruskaPs method, and (b) Prim’s method. Compare their relative efficiencies. 




CUT-SETS AND CUT-VERTICES 


In Chapter 3 we studied the spanning tree—a special type of subgraph of 
a connected graph G —which kept all the vertices of G together. In this chap¬ 
ter we shall study the cut-set —another type of subgraph of a connected graph 
G whose removal from G separates some vertices from others in G. Properties 
of cut-sets and their applications will be covered. Other related topics, such 
as connectivity, separability, and vulnerability of graphs, will also be dis¬ 
cussed. 

4-1. CUT-SETS 

In a connected graph G, a cut-set is a set of edgest whose removal from G 
leaves G disconnected, provided removal of no proper subset of these edges 
disconnects G. For instance, in Fig. 4-1 the set of edges {a, c, d,f ) is a cut-set. 
There are many other cut-sets, such as { a , b, g], {a, b, e,f}, and { d, h,f}. Edge 
{ k } alone is also a cut-set. The set of edges {a, c, h, d}, on the other hand, is not 
a cut-set, because one of its proper subsets, [a, c, h], is a cut-set. 

To emphasize the fact that no proper subset of a cut-set can be a cut-set, 
some authors refer to a cut-set as a minimal cut-set, a proper cut-set, or a 
simple cut-set. Sometimes a cut-set is also called a cocycle. We shall just use 
the term cut-set. 

A cut-set always “cuts” a graph into two. Therefore, a cut-set can also be 
defined as a minimal set of edges in a connected graph whose removal reduces 
the rank of the graph by one. The rank of the graph in Fig. 4.1(b), for in- 

fSince a set of edges (together with their end vertices) constitutes a subgraph, a cut¬ 
set in G is a subgraph of G. 
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(a) 


(b) 


Fig. 4-1 Removal of a cut-set [a, c, d,f} from a graph “cuts” it 
into two. 


stance, is four, one less than that of the graph in Fig. 4.1(a). Another way of 
looking at a cut-set is this: if we partition all the vertices of a connected graph 
G into two mutually exclusive subsets, a cut-set is a minimal number of edges 
whose removal from G destroys all paths between these two sets of vertices. 
For example, in Fig. 4-1 (a) cut-set [a, c, d,f } connects vertex set [v,, v 2 , v 6 ) 
with {v 3 , v 4 , v s }. (Note that one or both of these two subsets of vertices may 
consist of just one vertex.) Since removal of any edge from a tree breaks the 
tree into two parts, every edge of a tree is a cut-set. 

Cut-sets are of great importance in studying properties of communication 
and transportation networks. Suppose, for example, that the six vertices in 
Fig. 4-1 (a) represent six cities connected by telephone lines (edges). We wish 
to find out if there are any weak spots in the network that need strengthening 
by means of additional telephone lines. We look at all cut-sets of the graph, 
and the one with the smallest number of edges is the most vulnerable. In Fig. 
4-1(a), the city represented by vertex v 3 can be severed from the rest of the 
network by the destruction of just one edge. After some additional study of 
the properties of cut-sets, we shall return to their applications. 


4-2. SOME PROPERTIES OF A CUT SET 

Consider a spanning tree Tina connected graph G and an arbitrary cut¬ 
set S' in G. Is it possible for S not to have any edge in common with 7”? The 
answer is no. Otherwise, removal of the cut-set S from G would not discon¬ 
nect the graph. Therefore, 
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Theorem 4-1 

Every cut-set in a connected graph G must contain at least one branch of every 
spanning tree of G. 

Will the converse also be true? In other words, will any minimal set of 
edges containing at least one branch of every spanning tree be a cut-set? The 
answer is yes, by the following reasoning: 

In a given connected graph G, let Q be a minimal set of edges containing 
at least one branch of every spanning tree of G. Consider G — Q, the sub¬ 
graph that remains after removing the edges in Q from G. Since the subgraph 
G — Q contains no spanning tree of G, G — Q is disconnected (one compo¬ 
nent of which may just consist of an isolated vertex). Also, since Q is a mini¬ 
mal set of edges with this property, any edge e from Q returned to G — Q 
will create at least one spanning tree. Thus the subgraph G — Q + e will be 
a connected graph. Therefore, Q is a minimal set of edges whose removal 
from G disconnects G. This, by definition, is a cut-set. Hence 

Theorem 4-2 

In a connected graph G , any minimal set of edges containing at least one branch 
of every spanning tree of G is a cut-set. 

Theorem 4-3 

Every circuit has an even number of edges in common with any cut-set. 

Proof: Consider a cut-set S in graph G (Fig. 4-2). Let the removal of S partition 
the vertices of G into two (mutually exclusive or disjoint) subsets V x and V 2 . Con¬ 
sider a circuit T in G. If all the vertices in T are entirely within vertex set V x (or V 2 ), 
the number of edges common to S and T is zero; that is, N(S n T) = 0, an even 
number.t 

If, on the other hand, some vertices in T are in and some in V 2 , we traverse 



Circuit T shown in heavy lines, and is 
traversed along the direction of the arrows 


Fig. 4-2 Circuit and a cut-set in G. 
tAs in Chapter 3, N{g) stands for the number of edges in subgraph g. 
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back and forth between the sets V x and V 2 as we traverse the circuit (see Fig. 4-2). 
Because of the closed nature of a circuit, the number of edges we traverse between 
V x and V 2 must be even. And since very edge in S has one end in V x and the other 
in V 2 , and no other edge in G has this property (of separating sets V x and V 2 ), 
the number of edges common to S and T is even. ■ 


4-3. ALL CUT-SETS IN A GRAPH 

In Section 4-1 it was shown how cut-sets are used to identify weak spots 
in a communication net. For this purpose we list all cut-sets of the correspond¬ 
ing graph, and find which ones have the smallest number of edges. It must 
also have become apparent to you that even in a simple example, such as in 
Fig. 4-1, there is a large number of cut-sets, and we must have a systematic 
method of generating all relevant cut-sets. 

In the case of circuits, we solved a similar problem by the simple technique 
of finding a set of fundamental circuits and then realizing that other circuits in 
a graph are just combinations of two or more fundamental circuits. We shall 
follow a similar strategy here. Just as a spanning tree is essential for defining 
a set of fundamental circuits, so is a spanning tree essential for a set of fun¬ 
damental cut-sets. It will be beneficial for the reader to look for the parallelism 
between circuits and cut-sets. 

Fundamental Cut-Sets: Consider a spanning tree T of a connected graph 
G. Take any branch b in T. Since { b } is a cut-set in T, { b } partitions all vertices 
of T into two disjoint sets—one at each end of b. Consider the same partition 
of vertices in G, and the cut set S in G that corresponds to this partition. Cut¬ 
set S' will contain only one branch b of T, and the rest (if any) of the edges in 
S are chords with respect to T. Such a cut-set S containing exactly one branch 
of a tree T is called a fundamental cut-set with respect to T. Sometimes a 
fundamental cut-set is also called a basic cut-set. In Fig. 4-3, a spanning tree 


\ 

\ 



Fig. 4-3 Fundamental cut-sets of a graph. 
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T (in heavy lines) and all five of the fundamental cut-sets with respect to T 
are shown (broken lines “cutting” through each cut-set). 

Just as every chord of a spanning tree defines a unique fundamental cir¬ 
cuit, every branch of a spanning tree defines a unique fundamental cut-set. It 
must also be kept in mind that the term fundamental cut-set (like the term 
fundamental circuit) has meaning only with respect to a given spanning tree. 

Now we shall show how other cut-sets of a graph can be obtained from a 
given set of cut-sets. 

Theorem 4-4 

The ring sum of any two cut-sets in a graph is either a third cut-set or an edge- 
disjoint union of cut-sets. 

Outline of Proof: Let S { and S 2 be two cut-sets in a given connected graph 
G. Let V x and V 2 be the (unique and disjoint) partitioning of the vertex set 
V of G corresponding to S x . Let V 3 and V 4 be the partitioning corresponding 
to S 2 . Clearly [see Figs. 4-4(a) and (b)], 

v, U V 2 = V and V x n V 2 = 0, 

v 3 U V 4 = V and V 3 D V 4 = 0 . 

Now let the subset (K, Pi V 4 ) U (V 2 n V 3 ) be called V 5 , and this by 

definition is the same as the ring sum V t @ V 3 . Similarly, let the subset 

(Vj n v 3 ) u (V 2 n V 4 ) be called V 6 , which is the same as V 2 @ V 3 . See Fig. 
4-4(c). 

The ring sum of the two cut-sets Sj @ S 2 can be seen to consist only of 
edges that join vertices in V $ to those in V 6 . Also, there are no edges outside 
5, © S 2 that join vertices in V 5 to those in V 6 . 

Thus the set of edges S x © S 2 produces a partitioning of V into V 5 and 
V 6 such that 

v 5 u V 6 = V and V 5 n V 6 = 0 . 

Hence S t © S 2 is a cut-set if the subgraphs containing V 5 and V 6 each remain 
connected after 5, © S 2 is removed from G. Otherwise, S x © S 2 is an edge- 
disjoint union of cut-sets. 

Example: In Fig. 4-3 let us consider ring sums of the following three pairs 
of cut-sets. 


{d, e,f) © {f g, h} = {d, e, g, h], another cut-set, 

[a, b } © [ b , c, e,f) = {a, c, e,f], another cut-set, 

{d, e, g, h) © {f g, k } = {d, e,f h, k } 

= {d, e,f} U [A, k}, an edge-disjoint 
union of cut-sets. ■ 
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(a) 


v 2 n v 3 




Fig. 4-4 Two cut-sets and their partitionings. 

So we have a method of generating additional cut-sets from a number of 
given cut-sets. Obviously, we cannot start with any two cut-sets in a given 
graph and hope to obtain all its cut-sets by this method. What then is a 
minimal set of cut-sets from which we can obtain every cut-set of G by taking 
ring sums? The answer (to be proved in Chapter 6) is the set of all fundamen¬ 
tal cut-sets with respect to a given spanning tree. 

4-4. FUNDAMENTAL CIRCUITS AND CUT-SETS 

Consider a spanning tree Tina given connected graph G. Let <?,. be a chord 
with respect to T, and let the fundamental circuit made by c,- be called T, con¬ 
sisting of k branches b x , b 2 , . . . , b k in addition to the chord c t ; that is, 

r = (c., b x , b 2 , . . . , b k } is a fundamental circuit with respect to T. 

Every branch of any spanning tree has a fundamental cut-set associated with 
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it. Let S i be the fundamental cut-set associated with 6,, consisting of q chords 
in addition to the branch b x ; that is, 

S x = [b x , c x , c 2 ,. . . , c g ] is a fundamental cut-set with respect to T. 

Because of Theorem 4-3, there must be an even number of edges common 
to r and S x . Edge b x is in both T and S x , and there is only one other edge in 
r (which is c,.) that can possibly also be in S x . Therefore, we must have two 
edges b x and c t common to S', and T. Thus the chord c. is one of the chords 
C’ ^2’ 4 ' • * C q - 

Exactly the same argument holds for fundamental cut-sets associated with 
b 2 , b 3 , ... , and b k . Therefore, the chord c t is contained in every fundamental 
cut-set associated with branches in I\ 

Is it possible for the chord c. to be in any other fundamental cut-set S' 
(with respect to T, of course) besides those associated with b x , b 2 ,. . . and b k ? 
The answer is no. Otherwise (since none of the branches in T are in S'), there 
would be only one edge c t common to S' and T, a contradiction to Theorem 
4-3. Thus we have an important result. 

Theorem 4-5 

With respect to a given spanning tree T , a chord c t that determines a fundamental 
circuit T occurs in every fundamental cut-set associated with the branches in T 
and in no other. 

As an example, consider the spanning tree [b, c , e, h , k], shown in heavy 
lines, in Fig. 4-3. The fundamental circuit made by chord /is 

{/ e, h, k}. 

The three fundamental cut-sets determined by the three branches e, h, and k 
are 

determined by branch e: { d,e,f ), 
determined by branch h : {/ g, h], 

determined by branch k: {/ g, k}. 

Chord / occurs in each of these three fundamental cut-sets, and there is no 
other fundamental cut-set that contains /. The converse of Theorem 4-5 is 
also true. 

Theorem 4-6 

With respect to a given spanning tree T, a branch b t that determines a funda¬ 
mental cut-set S is contained in every fundamental circuit associated with the 
chords in S, and in no others. 
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Proof: The proof consists of arguments similar to those that led to Theorem 
4-5. Let the fundamental cut-set S determined by a branch b t be 

S \Pi , Cl, C 2 , • • • , Cp\, 

and let T] be the fundamental circuit determined by chord c x : 

r 1 = {ci, b u b 2 ,. . . , b q }. 

Since the number of edges common to S and T { must be even, b ( must be in r,. 
The same is true for the fundamental circuits made by chords c 2 , c 3 , . . . , c p . 

On the other hand, suppose that 6, occurs in a fundamental circuit r p+1 made 
by a chord other than c u c 2 , . . . , c p . Since none of the chords c,, c 2 , . . . , c p is 
in r p+1 , there is only one edge b t common to a circuit 1%+! and the cut-set S, 
which is not possible. Hence the theorem. ■ 

Turning again for illustration to the graph in Fig. 4-3, consider branch 
e of spanning tree {b, c , e, h , k}. The fundamental cut-set determined by e is 

{e, dj }. 

The two fundamental circuits determined by chords d and / are 

determined by chord d: [d, c, e }, 
determined by chord /: {/, e, h, k}. 

Branch e is contained in both these fundamental circuits, and none of the 
remaining three fundamental circuits contains branch e. 

4-5. CONNECTIVITY AND SEPARABILITY 

Edge Connectivity: Each cut-set of a connected graph G consists of a cer¬ 
tain number of edges. The number of edges in the smallest cut-set (i.e., cut¬ 
set with fewest number of edges) is defined as the edge connectivity of G. 
Equivalently, the edge connectivity of a connected grapht can be defined as 
the minimum number of edges whose removal (i.e., deletion) reduces the 
rank of the graph by one. The edge connectivity of a tree, for instance, is one. 
The edge connectivities of the graphs in Figs. 4-1(a), 4-3, 4-5 are one, two, 
and three, respectively. 

Vertex Connectivity: On examining the graph in Fig. 4-5, we find that 
although removal of no single edge (or even a pair of edges) disconnects the 

fAlthough we shall talk of edge connectivity and vertex connectivity only for a con¬ 
nected graph, some authors define both the edge connectivity and vertex connectivity of 
a disconnected graph as zero. 
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graph, the removal of the single vertex v does.t Therefore, we define another 
analogous term called vertex connectivity. The vertex connectivity (or simply 
connectivity ) of a connected graph G is defined as the minimum number of 
vertices whose removal from G leaves the remaining graph disconnected.t 
Again, the vertex connectivity of a tree is one. The vertex connectivities of the 
graphs in Figs. 4-l(a), 4-3, and 4-5 are one, two, and one, respectively. Note 
that from the way we have defined it vertex connectivity is meaningful only 
for graphs that have three or more vertices and are not complete. 

Separable Graph: A connected graph is said to be separable if its vertex 
connectivity is one. All other connected graphs are called nonseparable. An 
equivalent definition is that a connected graph G is said to be separable if 
there exists a subgraph g in G such that g (the complement of g in G) and g 
have only one vertex in common. That these two definitions are equivalent 
can be easily seen (Problem 4-7). In a separable graph a vertex whose removal 
disconnects the graph is called a cut-vertex, a cut-node, or an articulation point. 
For example, in Fig. 4-5 the vertex v is a cut-vertex, and in Fig. 4-1(a) vertex 
v 4 is a cut-vertex. It can be shown (Problem 4-18) that in a tree every vertex 
with degree greater than one is a cut-vertex. Moreover: 


Theorem 4-7 

A vertex v in a connected graph G is a cut-vertex if and only if there exist two 
vertices x and y in G such that every path between x and y passes through v. 

The proof of the theorem is quite easy and is left as an exercise (Problem 
4-17). The implication of the theorem is very significant. It states that v is a 
crucial vertex in the sense that any communication between x and y (if G re¬ 
presented a communication network) must “pass through” v. 

f Recall that removal of a vertex implies the removal of all the edges incident on that 
vertex, because without both the end vertices an edge does not exist. On the other hand, 
when we delete or remove an edge from a graph, the end vertices of the edge are still left 
in the graph. 

fSee the footnote on p. 75. 
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Fig. 4-6 Graph with 8 vertices and 16 
edges. 

An Application: Suppose we are given n stations that are to be connected 
by means of e lines (telephone lines, bridges, railroads, tunnels, or highways) 
where e > n — 1. What is the best way of connecting? By “best” we mean 
that the network should be as invulnerable to destruction of individual sta¬ 
tions and individual lines as possible. In other words, construct a graph with 
n vertices and e edges that has the maximum possible edge connectivity and 
vertex connectivity. 

For example, the graph in Fig. 4-5 has n = 8, e = 16, and has vertex 
connectivity of one and edge connectivity of three. Another graph with the 
same number of vertices and edges (8 and 16, respectively) can be drawn as 
shown in Fig. 4-6. 

It can easily be seen that the edge connectivity as well as the vertex con¬ 
nectivity of this graph is four. Consequently, even after any three stations are 
bombed, or any three lines destroyed, the remaining stations can still con¬ 
tinue to “communicate” with each other. Thus the network of Fig. 4-6 is 
better connected than that of Fig. 4-5 (although both consist of the same 
number of lines—16). 

The next question is what is the highest vertex and edge connectivity we 
can achieve for a given n and el The following theorems constitute the 
answer. 

Theorem 4-8 

The edge connectivity of a graph G cannot exceed the degree of the vertex with 
the smallest degree in C. 

Proof: Let vertex v t be the vertex with the smallest degree in G. Let d(vA be 
the degree of v h Vertex v t can be separated from G by removing the d{ t\) edges 
incident on vertex v h Hence the theorem. ■ 

Theorem 4-9 

The vertex connectivity of any graph G can never exceed the edge connectivity 
of G. 

Proof: Let a denote the edge connectivity of G. Therefore, there exists a cut¬ 
set S in G with a edges. Let S partition the vertices of G into subsets V { and V 2 . 
By removing at most a vertices from V t (or V 2 ) on which the edges in S are incident, 
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we can effect the removal of S (together with all other edges incident on these 
vertices) from <7. Hence the theorem. ■ 

Corollary 

Every cut-set in a nonseparable graph with more than two vertices contains at 
least two edges. 


Theorem 4-10 

The maximum vertex connectivity one can achieve with a graph G of n vertices 
and e edges (e > n — 1) is the integral part of the number 2e/n; that is, |_2e/«J. 

Proof: Every edge in G contributes two degrees. The total ( 2e degrees) is divided 
among n vertices. Therefore, there must be at least one vertex in G whose degree 
is equal to or less than the number 2e/n. The vertex connectivity of G cannot exceed 
this number, in light of Theorems 4-8 and 4-9. 

To show that this value can actually be achieved, one can first construct an 
n-vertex regular graph of degree [_2ejn\ and then add the remaining e — («/2)*|_2e/«j 
edges arbitrarily. The completion of the proof is left as an exercise. ■ 

The results of Theorems 4-8, 4-9, and 4-10 can be summarized as follows: 


and 


2e 

vertex connectivity < edge connectivity < —» 


maximum vertex connectivity possible 


2e 

n 


Thus, for a graph with 8 vertices and 16 edges (Figs. 4-5 and 4-6), for example, 
we can achieve a vertex connectivity (and therefore edge connectivity) as 
high as four (= 2* 16/8). 

A graph G is said to be k-connected if the vertex connectivity of G is k; 
therefore, a 1-connected graph is the same as a separable graph. 

Theorem 4-11 

A connected graph G is ^-connected if and only if every pair of vertices in G is 
joined by k or more paths that do not intersect,t and at least one pair of vertices 
is joined by exactly k nonintersecting paths. 

Theorem 4-12 

The edge connectivity of a graph G is k if and only if every pair of vertices in 
G is joined by k or more edge-disjoint paths (i.e., paths that may intersect, but have 
no edges in common), and at least one pair of vertices is joined by exactly k edge- 
disjoint paths. 

f Paths with no common vertices, except the two terminal vertices, are called noninter¬ 
secting paths or vertex-disjoint paths. 
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The reader is referred to Chapter 5 of [1-5] for the proofs of Theorems 
4-11 and 4-12. Note that our definition of ^-connectedness is slightly differ¬ 
ent from the one given in [1-5]. A special result of Theorem 4-11 is that a 
graph G is nonseparable if and only if any pair of vertices in G can be placed 
in a circuit (Problem 4-13). 

The reader is encouraged to verify these theorems by enumerating all 
edge-disjoint and vertex-disjoint paths between each of the 15 pairs of ver¬ 
tices in Fig. 4-3. 


4-6. NETWORK FLOWS 

In a network of telephone lines, highways, railroads, pipelines of oil (or 
gas or water), and so on, it is important to know the maximum rate of flow 
that is possible from one station to another in the network. This type of net¬ 
work is represented by a weighted connected graph in which the vertices 
are the stations and the edges are lines through which the given commodity 
(oil, gas, water, number of messages, number of cars, etc.) flows. The weight, 
a real positive number, associated with each edge represents the capacity of 
the line, that is, the maximum amount of flow possible per unit of time. The 
graph in Fig. 4-7, for example, represents a flow network consisting of 12 
stations and 31 lines. The capacity of each of these lines is also indicated in 
the figure. 

It is assumed that at each intermediate vertex the total rate of commodity 
entering is equal to the rate leaving. In other words, there is no accumulation 
or generation of the commodity at any vertex along the way. Furthermore, 
the flow through a vertex is limited only by the capacities of the edges inci¬ 
dent on it. In other words, the vertex itself can handle as much flow as allowed 
through the edges. Finally, the lines are lossless. 



Fig. 4-7 Graph of a flow network. 
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In such a flow problem the questions to be answered are 

1. What is the maximum flow possible through the network between a 
specified pair of vertices—say, from B to M in Fig. 4-7? 

2. How do we achieve this flow (i.e., determine the actual flow through 
each edge when the maximum flow exists)? 

Theorem 4-13, perhaps the most important result in the theory of trans¬ 
port networks, answers the first question. The second question is answered 
implicitly by a constructive proof of the theorem. To facilitate the statement 
and proof of the theorem, let us define a few terms. 

A cut-set with respect to a pair of vertices a and b in a connected graph G 
puts a and b into two different components (i.e., separates vertices a and b). 
For instance, in Fig. 4-3 cut-set { d , e,f} is a cut-set with respect to v x and v 6 . 
The set [f g, h } is also a cut-set with respect to v x and v 6 . But the cut-set 
{/ g > h] is not a cut-set with respect to v x and v 6 . The capacity of cut-set S in 
a weighted connected graph G (in which the weight of each edge represents 
its flow capacity) is defined as the sum of the weights of all the edges in S. 

Theorem 4-13 

The maximum flow possible between two vertices a and b in a network is equal 
to the minimum of the capacities of all cut-sets with respect to a and b. 

Proof: Consider any cut-set 5" with respect to vertices a and b in G. In the sub¬ 
graph G — S (the subgraph left after removing S from G) there is no path between 
a and b. Therefore, every path in G between a and b must contain at least one edge 
of S. Thus every flow from a to b (or from b to a) must pass through one or more 
edges of S. Hence the total flow rate between these two vertices cannot exceed the 
capacity of S. Since this holds for all cut-sets with respect to a and b, the flow rate 
cannot exceed the minimum of their capacities. ■ 

To show that this flow can actually be achieved is somewhat involved. It 
requires some concepts that are to be introduced later. The complete proof 
will therefore be deferred till Chapter 14, where flow problems will be treated 
in much greater detail. 

4-7. 1-ISOMORPHISM 

A separable graph consists of two or more nonseparable subgraphs. Each 
of the largest nonseparable subgraphs is called a block. (Some authors use the 
term component , but to avoid confusion with components of a disconnected 
graph, we shall use the term block.) The graph in Fig. 4-5 has two blocks. The 
graph in Fig. 4-8 has five blocks (and three cut-vertices a, b, and c); each block 
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Fig. 4-8 Separable graph with three cut-vertices and five blocks. 



Fig. 4-9 Disconnected graph 1-isomorphic to Fig. 4-8. 


is shown enclosed by a broken line. Note that a nonseparable connected 
graph consists of just one block. 

Visually compare the disconnected graph in Fig. 4-9 with the one in Fig. 
4-8. These two graphs are certainly not isomorphic (they do not have the same 
number of vertices), but they are related by the fact that the blocks of the 
graph in Fig. 4-8 are isomorphic to the components of the graph in Fig. 4-9. 
Such graphs are said to be 1-isomorphic. More formally: 

Two graphs G { and G 2 are said to be 1-isomorphic if they become isomor¬ 
phic to each other under repeated application of the following operation. 

Operation 1: “Split” a cut-vertex into two vertices to produce two disjoint 
subgraphs. 

From this definition it is apparent that two nonseparable graphs are 1- 
isomorphic if and only if they are isomorphic. 

Theorem 4-14 

If G i and G 2 are two 1-isomorphic graphs, the rank of G i equals the rank of 
G 2 and the nullity of Gi equals the nullity of G 2 . 

Proof: Under operation 1, whenever a cut-vertex in a graph G is “split” into 
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two vertices, the number of components in G increases by one. Therefore, the rank 
of G which is 


number of vertices in G — number of components in G 
remains invariant under operation 1. 

Also, since no edges are destroyed or new edges created by operation 1, two 
1-isomorphic graphs have the same number of edges. Two graphs with equal rank 
and with equal numbers of edges must have the same nullity, because 

nullity — number of edges — rank. ■ 

What if we join two components of Fig. 4-9 by “gluing” together two 
vertices (say vertex x to y)l We obtain the graph shown in Fig. 4-10. 

Clearly, the graph in Fig. 4-10 is 1-isomorphic to the graph in Fig. 4-9. 
Since the blocks of the graph in Fig. 4-10 are isomorphic to the blocks of the 
graph in Fig. 4-8, these two graphs are also 1-isomorphic. Thus the three 
graphs in Figs. 4-8, 4-9, and 4-10 are 1-isomorphic to one another. 



Fig. 4-10 Graph 1-isomorphic to Figs. 4-8 and 4-9. 


4-8. 2-ISOMORPHISM 

In Section 4-7 we generalized the concept of isomorphism by introducing 
1-isomorphism. A graph <7, was 1-isomorphic to graph G 2 if the blocks of (7, 
were isomorphic to the blocks of <7 2 . Since a nonseparable graph is just one 
block, 1-isomorphism for nonseparable graphs is the same as isomorphism. 
However, for separable graphs (i.e., graphs with vertex connectivity of one), 
1-isomorphism is different from isomorphism. Graphs that are isomorphic 
are also 1-isomorphic, but 1-isomorphic graphs may not be isomorphic. This 
generalized isomorphism is very useful in the study of separable graphs. 
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We can generalize this concept further to broaden its scope for 2-connect- 
ed graphs (i.e., graphs with vertex connectivity of two), as follows: 

In a 2-connected graph G let vertices x and y be a pair of vertices whose 
removal from G will leave the remaining graph disconnected. In other words, 
G consists of a subgraph g, and its complement g, such that g, and g, have 
exactly two vertices, x and y, in common. Suppose that we perform the fol¬ 
lowing operation 2 on G (after which, of course, G no longer remains the 
original graph). 

Operation 2: “Split” the vertex x into x x and x 2 and the vertex y into y x 
and >> 2 such that <7 is split intog, andg]. Let vertices x x andj>, gowithg, and 
x 2 and>> 2 withg,. Now rejoin the graphs g x andg! by merging x x with y 2 and 
x 2 with y x . (Clearly, edges whose end vertices were x and y in G could have 
gone with g, or g,, without affecting the final graph.) 

Two graphs are said to be 2-isomorphic if they become isomorphic after 
undergoing operation 1 (in Section 4-7) or operation 2, or both operations 
any number of times. For example, Fig. 4-11 shows how the two graphs in 
Figs. 4-11(a) and (d) are 2-isomorphic. Note that in (a) the degree of vertex 
* is four, but in (d) no vertex is of degree four. 

From the definition it follows immediately that isomorphic graphs are 
always 1-isomorphic, and 1-isomorphic graphs are always 2-isomorphic. 
But 2-isomorphic graphs are not necessarily 1-isomorphic, and 1-isomorphic 



Fig. 4-11 2-isomorphic graphs (a) and (d). 
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graphs are not necessarily isomorphic. However, for graphs with connectivity 
three or more, isomorphism, 1-isomorphism, and 2-isomorphism are syn¬ 
onymous. 

It is clear that no edges or vertices are created or destroyed under opera¬ 
tion 2. Therefore, the rank and nullity of a graph remain unchanged under 
operation 2. And as shown in Section 4-7, the rank or nullity of a graph does 
not change under operation 1. Therefore, 2-isomorphic graphs are equal in 
rank and equal in nullity. The fact that the rank r and nullity p are not enough 
to specify a graph within 2-isomorphism can easily be shown by constructing 
a counterexample (Problem 4-23). 

Circuit Correspondence: Two graphs G x and G 2 are said to have a circuit 
correspondence if they meet the following condition: There is a one-to-one 
correspondence between the edges of G, and G 2 and a one-to-one correspond¬ 
ence between the circuits of G, and G 2 , such that a circuit in G 1 formed by 
certain edges of G, has a corresponding circuit in G 2 formed by the corre¬ 
sponding edges of G 2 , and vice versa. Isomorphic graphs, obviously, have 
circuit correspondence. 

Since in a separable graph G every circuit is confined to a particular 
block (Problem 4-15), every circuit in G retains its-edges as G undergoes 
operation 1 (in Section 4-7). Hence 1-isomorphic graphs have circuit corre- 
pondence. 

Similarly, let us consider what happens to a circuit in a graph G when it 
undergoes operation 2, as defined in this section. A circuit T in G will fall in 
one of three categories: 

1. r is made of edges all in g,, or 

2. r is made of edges all in g l5 or 

3. r is made of edges from both g, and g,, and in that case T must include 
both vertices x and y. 

In cases 1 and 2, T is unaffected by operation 2. In case 3, T still has the 
original edges, except that the path between vertices x and y in g,, which 
constituted a part of T, is “flipped around.” Thus every circuit in a graph 
undergoing operation 2 retains its original edges. Therefore, 2-isomorphic 
graphs also have circuit correspondence. 

Theorem 4-15, which is considered the most important result for 2-isomor¬ 
phic graphs, is due to H. Whitney. 

Theorem 4-15 

Two graphs are 2-isomorphic if and only if they have circuit correspondence. 

Proof: The “only if” part has already been shown in the argument preceding 
the theorem. The “if” part is more involved, and the reader is referred to Whitney’s 
original paper [4-7]. 
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As we shall observe in subsequent chapters, the ideas of 2-isomorphism 
and circuit correspondence play important roles in the theory of contact 
networks, electrical networks, and in duality of graphs. 

SUMMARY 

Our main concern in this chapter was with answering the following ques¬ 
tion about a connected graph: Which part of a connected graph, when re¬ 
moved, breaks the graph apart? Clearly, the answer to this question does 
specify a graph in many aspects and tells a great deal about it. Some of these 
properties are of considerable significance both in theory and applications of 
graphs. 

In pursuit of the answer to the above question, we came across the con¬ 
cepts of cut-sets, cut-vertices, connectivity, and so on. Many of the theorems 
showed relationships between these characteristics of a graph. 

In contrast to a spanning tree (which keeps the vertices together), a cut¬ 
set separates the vertices. Consequently, there was bound to be a close re¬ 
lationship between a spanning tree and a cut-set. Some of the theorems (and 
the problems at the end of this chapter) describe this relationship between 
spanning trees and cut-sets. 

In terms of the minimum number of vertices whose removal disconnects 
a graph, all graphs can be classified according to Fig. 4-12. 



(m is the vertex-connectivity) 

Fig. 4-12 Classification of graphs according to their connectivity. 
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A very important and practical result of this chapter was the max-flow 
min-cut theorem (Theorem 4-13). 
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PROBLEMS 

4-1. Pick an arbitrary spanning tree in the graph given in Fig.4-6. List all seven (because 
n — 1 = 7) fundamental cut-sets with respect to this tree. 

4-2. By taking the ring sum of the seven fundamental cut-sets obtained in Problem 4-1, 
list all other cut-sets of the graph. 

4-3. List all cut-sets with respect to the vertex pair vz, V3 in the graph in Fig. 4-1 (a). 
4-4. Show that the edge connectivity and vertex connectivity of the graphs in Fig. 2-2 
are each equal to three. 

4-5. What is the edge connectivity of the complete graph of n vertices? 

4-6. Prove that in a connected graph G the complement of a cut-set in G does not con¬ 
tain a spanning tree and the complement of a spanning tree (i.e., chord set) does 
not contain a cut-set. 

4-7. Show that the two definitions of separability in Section 4-5 are equivalent. 

4-8. Prove that in a nonseparable graph G the set of edges incident on each vertex of G 

is a cut-set. 
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4-9. Why is the result of Problem 4-8 not applicable to separable graphs also? Explain. 
4-10. Prove that in a connected graph G a vertex v is a cut-vertex if and only if there 
exist two (or more) edges x and y incident on v such that no circuit in G includes 
both * and y. 

4-11. Prove that every connected graph with three or more vertices has at least two 
vertices which are not cut-vertices. 

4-12. Prove that a nonseparable graph has a nullity p = 1 if and only if the graph is a 
circuit. 

4-13. Show that a graph G is nonseparable if and only if every vertex pair in G can be 
placed in some circuit in G. 

4-14. Show that a simple graph is nonseparable if and only if for any two given arbitrary 
edges a circuit can always be found that will include these two edges. 

4-15. How can you utilize the result of Problem 4-13 to obtain an algorithm for iden¬ 
tifying every block of a large separable graph? 

4-16. What is a necessary and sufficient condition that any n — 1 cut-sets in Problem 4-8 
constitute a set of fundamental cut-sets in G? 

4-17. Prove Theorem 4-7. 

4-18. Prove that in a tree every vertex of degree greater than one is a cut-vertex. 

4-19. Show that a graph with n vertices and with vertex connectivity k must have at 
least kn) 2 edges. (A special case of this result is that the degree of every vertex in a 
nonseparable graph is at least two.) 

4-20. Is every regular graph of degree d{d> 3) nonseparable ? If not, give a simple regular 
graph of degree three that is separable. 

4-21. Complete the proof of Theorem 4-10. 

4-22. In a connected graph G, let Q be a set of edges with the following properties: 

(a) Q has an even number (zero included) of edges in common with every cut-set 
of G. 

(b) There is no proper subset of Q that satisfies property (a). 

Prove that Q is a circuit. 

4-23. Construct a graph G with the following properties: Edge connectivity of G = 4, 
vertex connectivity of G = 3, and degree of every vertex of G > 5. 

4-24. Show (by drawing them) that two graphs with the same rank and the same nullity 
need not be 2-isomorphic. 

4-25. In Fig. 4-7, between vertices A and M, pick out a complete set of 

(a) Edge-disjoint paths. 

(b) Vertex-disjoint paths. 

From this, verify Theorems 4-11 and 4-12. 

4-26. Suppose that a singles tennis tournament is to be arranged among n players and the 
number of matches planned is a fixed number e (where n — 1 < e < n{n — l)/2). 
For the sake of fairness, how will you make sure that some players do not group 
together and isolate an individual (or a small group of players)? 

4-27. Let us define a new term called edge isomorphism as follows: Two graphs G\ and 
G 2 are edge isomorphic if there is a one-to-one correspondence between the edges 
of G i and Gi such that two edges are incident (at a common vertex) in G\ if and 
only if the corresponding edges are also incident in Gi. Discuss the properties of 
edge isomorphism. Construct an example to prove that edge-isomorphic graphs 
may not be isomorphic. 

4-28. Prove that an Euler graph cannot have a cut-set with an odd number of edges. 
{Hint: Use Theorem 1-1.) 




PLANAR AND DUAL GRAPHS 


In Chapters 2, 3, and 4 we studied properties of subgraphs, such as paths, 
circuits, spanning trees, and cut-sets, in a given connected graph G. In this 
chapter we shall subject the entire graph G to the following important ques¬ 
tion: Is it possible to draw G in a plane without its edges crossing over? 

This question of planarity is of great significance from a theoretical point 
of view. In addition, planarity and other related concepts are useful in many 
practical situations. For instance, in the design of a printed-circuit board, the 
electrical engineer must know if he can make the required connections without 
an extra layer of insulation. The solution to the puzzle of three utilities, posed 
in Chapter 1, requires the knowledge of whether or not the corresponding 
graph can be drawn in a plane. 

But before we attempt to draw a graph in a plane, let us examine the 
meaning of “drawing” a graph. 


5-1. COMBINATORIAL VERSUS GEOMETRIC 
GRAPHS 

As mentioned in Chapter 1, a graph exists as an abstract object, devoid of 
any geometric connotation of its ability of being drawn in a three-dimensional 
Euclidean space. For example, an abstract graph G, can be defined as 

G, = (V, E, 'F), 

where the set V consists of the five objects named a, b, c, d, and e, that is, 

V = {a, b, c, d, e]. 
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and the set E consists of seven objects (none of which is in set V ) named 1,2, 
3, 4, 5, 6, and 7, that is, 


£={1,2, 3, 4, 5, 6, 7], 


and the relationship between the two sets is defined by the mapping 'F, which 
consists of 


- 


1 —> (a, c ) 

2 —> (c, d) 

3 —> ( a , d ) 

4 —> {a, b ). 

5 —> ( b , d) 

6 —> (d, e) 

7 —> (b, e ) 


Here, the symbol 1 ■ -> (a, c ) says that object 1 from set E is mapped onto the 
(unordered) pair ( a , c) of objects from set V. 

Now it so happens that this combinatorial abstract object G, can also be 
represented by means of a geometric figure. In fact, the sketch in Fig. 2-13 is 
one such geometric representation of this graph. Moreover, it is also true that 
any graph can be represented by means of such a configuration in three- 
dimensional Euclidean space. 

It is important to realize that what is sketched in Fig. 2-13 is merely one 
(out of infinitely many) representation of the graph and not the graph G, 
itself. We could have, for instance, twisted some of the edges or could have 
placed e within the triangle a , d , b and thereby obtained a different figure 
representing G,. However, when there is no chance of confusion, a pictorial 
representation of the graph has been and will be regarded as the graph 
itself. 

This convenient slurring over is done deliberately for the sake of simplicity 
and clarity. Learning graph theory for the first time without any diagrams 
would be extremely difficult and little fun.t 

Unlike in the last four chapters, in this chapter it will often be necessary to 
make a distinction between the abstract (or combinatorial) graph and a 
geometric representation of a graph. 

fAt this point I cannot resist quoting the following comment by Hadamard: “Des¬ 
cartes distrusts that intervention of imagination, and wishes to eliminate it completely 
from science.. . . More recently, another rigorous treatment of. . . geometry . . . freed 
from any appeal to intuition, has been developed ... by the celebrated mathematician 
Hilbert. Logically, every intervention of geometrical sense is eliminated. But is it the same 
from the psychological point of view? Certainly not... . Diagrams appear at practically 
every page (of Hilbert’s book).” 
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5-2. PLANAR GRAPHS 

A graph G is said to be planar if there exists some geometric representation 
of G which can be drawn on a plane such that no two of its edges intersect.t 
A graph that cannot be drawn on a plane without a crossover between its 
edges is called nonplanar. 

A drawing of a geometric representation of a graph on any surface such 
that no edges intersect is called embedding. Thus, to declare that a graph G is 
nonplanar, we have to show that of all possible geometric representations of 
G none can be embedded in a plane. Equivalently, a geometric graph G is 
planar if there exists a graph isomorphic to G that is embedded in a plane. 
Otherwise, G is nonplanar. An embedding of a planar graph G on a plane is 
called a plane representation of G. 

For instance, consider the graph represented by Fig. 1-3. The geometric 
representation shown in Fig. 1-3 clearly is not embedded in a plane, because 
the edges e and/are intersecting. But if we redraw edge/outside the quadri¬ 
lateral, leaving the other edges unchanged, we have embedded the new 
geometric graph in the plane, thus showing that the graph which is being 
represented by Fig. 1-3 is planar. As another example, the two isomorphic 
diagrams in Fig. 2-2 are different geometric representations of one and the 
same graph. One of the diagrams is a plane representation; the other one is 
not. The graph, of course, is planar. On the other hand, you will not be able 
to draw any of the three configurations in Fig. 2-3 on a plane without edges 
intersecting. The reason is that the graph which these three different diagrams 
in Fig. 2-3 represent is nonplanar. 

A natural question now is: How can we tell if a graph G [which may be 
given by an abstract notation G = ( V, E, *F) or by one of its geometric 
representations] is planar or nonplanar? To answer this question, let us first 
discuss two specific nonplanar graphs which are of fundamental importance. 
These are called Kuratowski’s graphs, after the Polish mathematician Kasimir 
Kuratowski, who discovered their unique property. 


5-3. KURATOWSKI'S TWO GRAPHS 

Theorem 5-1 

The complete graph of five vertices is nonplanar. 

Proof: Let the five vertices in the complete graph be named v u v 2 , v 3 , v 4 , and 
v 5 . A complete graph, as you may recall, is a simple graph in which every vertex 
is joined to every other vertex by means of an edge. This being the case, we must 


fNote that the “meeting” of edges at a vertex is not considered an intersection. 
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u 2 



V 5 V 4 

(e) 

Fig. 5-1 Building up of the five-vertex complete graph. 


have a circuit going from v x to v 2 to v 3 to v 4 to v 5 to v x —that is, a pentagon. 
See Fig. 5-1 (a). This pentagon must divide the plane of the paper into two regions, 
one inside and the other outside (Jordan curve theorem). 

Since vertex v x is to be connected to v 3 by means of an edge, this edge may be 
drawn inside or outside the pentagon (without intersecting the five edges drawn 
previously). Suppose that we choose to draw a line from v x to v 3 inside the pen¬ 
tagon. See Fig. 5-1 (b). (If we choose outside, we end up with the same argument.) 
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Now we have to draw an edge from v 2 to v 4 and another one from v 2 to v s . Since 
neither of these edges can be drawn inside the pentagon without crossing over the 
edge already drawn, we draw both these edges outside the pentagon. See Fig. 5-1 (c). 
The edge connecting v 3 and v 5 cannot be drawn outside the pentagon without 
crossing the edge between v 2 and v 4 . Therefore, v 3 and v 5 have to be connected 
with an edge inside the pentagon. See Fig. 5-1 (d). 

Now we have yet to draw an edge between v x and v 4 . This edge cannot be 
placed inside or outside the pentagon without a crossover. Thus the graph cannot 
be embedded in a plane. See Fig. 5-1 (e). ■ 

Some readers may find this proof somewhat unsatisfactory because it 
depends so heavily on visual intuition. Do not despair; we shall provide you 
with an algebraic nonvisual proof in the next section. 

A complete graph with five vertices is the first of the two graphs of 
Kuratowski. The second graph of Kuratowski is a regulart connected graph 
with six vertices and nine edges, shown in its two common geometric rep¬ 
resentations in Figs. 5-2(a) and (b), where it is fairly easy to see that the 
graphs are isomorphic. 

Employing visual geometric arguments similar to those used in proving 
Theorem 5-1, it can be shown that the second graph of Kuratowski is also 
nonplanar. The proof of Theorem 5-2 is, therefore, left as an exercise 
(Problem 5-1). 



(b) 


Fig. 5-2 Kuratowski’s second graph. 


Theorem 5-2 

Kuratowski’s second graph is also nonplanar. 

You may have noticed several properties common to the two graphs of 
Kuratowski. These are 

1. Both are regular graphs. 

fRecall that a graph in which all vertices are of equal degree is called a regular graph. 
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2. Both are nonplanar. 

3. Removal of one edge or a vertex makes each a planar graph. 

4. Kuratowski’s first graph is the nonplanar graph with the smallest 
number of vertices, and Kuratowski’s second graph is the nonplanar 
graph with the smallest number of edges. Thus both are the simplest 
nonplanar graphs. 

In the literature, Kuratowski’s first graph is usually denoted by K s and the 
second graph by K 33 —letter K being for Kuratowski. 


5-4. DIFFERENT REPRESENTATIONS OF A 
PLANAR GRAPH 

In following the proof of Theorem 5-1, it may have appeared that one’s 
ability to draw a planar graph in a plane depended on his ability to draw many 
crooked lines through devious routes. This is not the case. The following 
important and somewhat surprising result, due to Fary, tells us that there is 
no need to bend edges in drawing a planar graph to avoid edge intersections. 

Theorem 5-3 

Any simple planar graph can be embedded in a plane such that every edge is 
drawn as a straight line segment. 

Proof: The proof is involved and does not contribute much to the understanding 
of planarity. The interested reader is, therefore, referred to pages 74-77 in [1-2] 
or to the original paper of Fary [5-4]. As an illustration, the graph in Fig. 5-1 (d) 
can be redrawn using straight line segments to look like Fig. 5-3. In this theorem, 
it is necessary for the graph to be simple because a self-loop or one of two parallel 
edges cannot be drawn by a straight line segment. ■ 

Region: A plane representation of a graph divides the plane into regions 
(also called windows , faces, or meshes ), as shown in Fig. 5-4. A region is 





Fig. 5-3 Straight-line representation of 
the graph in Fig. 5-l(d). 
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Fig. 5-4 Plane representation (the num¬ 
bers stand for regions). 

characterized by the set of edges (or the set of vertices) forming its boundary. 
Note that a region is not defined in a nonplanar graph or even in a planar 
graph not embedded in a plane. For example, the geometric graph in Fig. 1-3 
does not have regions. Thus a region is a property of the specific plane 
representation of a graph and not of an abstract graph per se. 

Infinite Region: The portion of the plane lying outside a graph embedded 
in a plane, such as region 4 in Fig. 5-4, is infinite in its extent. Such a region is 
called the infinite, unbounded, outer, or exterior region for that particular plane 
representation. Like other regions, the infinite region is also characterized by 
a set of edges (or vertices). Clearly, by changing the embedding of a given 
planar graph, we can change the infinite region. For instance, Figs. 5-1(d) and 
5-3 are two different embeddings of the same graph. The finite region v x v 3 v 5 
in Fig. 5-1 (d) becomes the infinite region in Fig. 5-3. In fact, we shall shortly 
show that any region can be made the infinite region by proper embedding. 

Embedding on a Sphere: To eliminate the distinction between finite and 
infinite regions, a planar graph is often embedded in the surface of a sphere. 
It is accomplished by stereographic projection of a sphere on a plane. Put 
the sphere on the plane and call the point of contact SP (south pole). At point 
SP, draw a straight line perpendicular to the plane, and let the point where 
this line intersects the surface of the sphere be called NP (north pole). See 
Fig. 5-5. 

Now, corresponding to any point p on the plane, there exists a unique 
point p' on the sphere and vice versa, where p' is the point at which the 
straight line from point p to point NP intersects the surface of the sphere. 
Thus there is a one-to-one correspondence between the points of the sphere 
and the finite points on the plane, and points at infinity in the plane corre¬ 
spond to the point NP on the sphere. 

From this construction, it is clear that any graph that can be embedded in 
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Fig. 5-5 Stereographic projection. 


a plane (i.e., drawn on a plane such that its edges do not intersect) can also be 
embedded in the surface of the sphere, and vice versa. Hence 

Theorem 5-4 

A graph can be embedded in the surface of a sphere if and only if it can be 
embedded in a plane. 

A planar graph embedded in the surface of a sphere divides the surface 
into different regions. Each region on the sphere is finite, the infinite region on 
the plane having been mapped onto the region containing the point NP. Now 
it is clear that by suitably rotating the sphere we can make any specified 
region map onto the infinite region on the plane. From this we obtain 

Theorem 5-5 

A planar graph may be embedded in a plane such that any specified region 
(i.e., specified by the edges forming it) can be made the infinite region. 

Thinking in terms of the regions on the sphere, we see that there is no real 
difference between the infinite region and the finite regions on the plane. 
Therefore, when we talk of the regions in a plane regresentation of a graph, 
we include the infinite region. Also, since there is no essential difference 
between an embedding of a planar graph on a plane or on a sphere (a plane 
may be regarded as the surface of a sphere of infinitely large radius), the term 
“plane representation” of a graph is often used to include spherical as well as 
planar embedding. 

Euler's Formula: Since a planar graph may have different plane represen¬ 
tations, we may ask if the number of regions resulting from each embedding 
is the same. The answer is yes. Theorem 5-6, known as Euler’s formula, gives 
the number of regions in any planar graph. 
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Theorem 5-6 

A connected planar graph with n vertices and e edges has e — n + 2 regions. 

Proof: It will suffice to prove the theorem for a simple graph, because adding 
a self-loop or a parallel edge simply adds one region to the graph and simultane¬ 
ously increases the value of e by one. We can also disregard (i.e., remove) all edges 
that do not form boundaries of any region. Three such edges are shown in Fig. 5-4. 
Addition (or removal) of any such edge increases (or decreases) e by one and in¬ 
creases (or decreases) n by one, keeping the quantity e — n unaltered. 

Since any simple planar graph can have a plane representation such that each 
edge is a straight line (Theorem 5-3), any planar graph can be drawn such that 
each region is a polygon (a polygonal net). Let the polygonal net representing the 
given graph consist of/regions or faces, and let k p be the number of p-sided regions. 
Since each edge is on the boundary of exactly two regions, 

3 'k 3 + 4 'k± + 5 -k 5 4- • • • + r-k r = 2*c, (5-1) 

where k r is the number of polygons, with maximum edges. 

Also, 

k% + k$ + k 5 + • • • T k r = /. (5-2) 


The sum of all angles subtended at each vertex in the polygonal net is 

Inn. (5-3) 

Recalling that the sum of all interior angles of a p-sided polygon is n(p — 2), and 
the sum of the exterior angles is n(p + 2), let us compute the expression in (5-3) 
as the grand sum of all interior angles of / — 1 finite regions plus the sum of the 
exterior angles of the polygon defining the infinite region. This sum is 


n(3 - 2)-k 3 + tt(4 - 2 )-k A + • • • 

-)- n(r — 2 )-k r + An 

= n(2e -2 f) + 4n. 

(5-4) 

Equating (5-4) to (5-3), we get 


2n(e — f) + 4n = 

2nn, 

or e — f + 2 = 

n. 

Therefore, the number of regions is 


f = e — n + 2. 

m 

Corollary 


In any simple, connected planar graph with / regions, n vertices, and e edges 
( e > 2), the following inequalities must hold: 

. 3, 

& ^ 4 : 2^ 

(5-5) 

e < 3« — 6. 

(5-6) 
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Proof: Since each region is bounded by at least three edges and each edge 
belongs to exactly two regions, 

2e>3f 

or C > y/ 

Substituting for /from Euler’s formula in inequality (5-5), 

e > ~(e — n + 2) 

or e < 3n — 6. ■ 

Inequality (5-6) is often useful in finding out if a graph is nonplanar. For 
example, in the case of K 5 , the complete graph of five vertices [Fig. 5-1 (e)], 

n = 5, e — 10, 3n — 6 = 9 < e. 

Thus the graph violates inequality (5-6), and hence it is not planar. 

Incidentally, this is an alternative and independent proof of the non¬ 
planarity of Kuratowski’s first graph, as promised in Section 5-3. 

The reader must be warned that inequality (5-6) is only a necessary, but 
not a sufficient, condition for the planarity of a graph. In other words, 
although every simple planar graph must satisfy (5-6), the mere satisfaction 
of this inequality does not guarantee the planarity of a graph. For example, 
Kuratowski’s second graph, K 3>3 , satisfies (5-6), because 

e = 9, 

3n — 6 = 3*6 — 6 = 12. 

Yet the graph is nonplanar. 

To prove the nonplanarity of Kuratowski’s second graph, we make use of 
the additional fact that no region in this graph can be bounded with fewer 
than four edges. Hence, if this graph were planar, we would have 

2c > 4/, 

and, substituting for/from Euler’s formula, 

2e > 4(e — n + 2), 
or 2-9 > 4(9 - 6 + 2), 

or 18 > 20, a contradiction. 

Hence the graph cannot be planar. 

Plane Representation and Connectivity: In a disconnected graph the 
embedding of each component can be considered independently. Therefore, 
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it is clear that a disconnected graph is planar if and only if each of its com¬ 
ponents is planar. Similarly, in a separable (or 1-connected) graph the 
embedding of each block (i.e., maximal nonseparable subgraph) can be 
considered independently. Hence a separable graph is planar if and only if 
each of its blocks is planar. 

Therefore, in questions of embedding or planarity, one need consider only 
nonseparable graphs. 

Does a nonseparable planar graph G have a unique embedding on a 
sphere? Before answering this question, we must define the meaning of unique 
embedding. Two embeddings of a planar graph on spheres are not distinct if 
the embeddings can be made to coincide by suitably rotating one sphere with 
respect to the other and possibly distorting regions (without letting a vertex 
cross an edge). If of all possible embeddings on a sphere no two are distinct, 
the graph is said to have a unique embedding on a sphere (or a unique plane 
representation). 

For example, consider two embeddings of the same graph in Fig. 5-6. The 
embedding (b) has a region bounded with five edges, but embedding (a) has 
no region with five edges. Thus, rotating the two spheres on which (a) and (b) 
are embedded will not make them coincide. Hence the two embeddings are 
distinct, and the graph has no unique plane representation. 

On the other hand, the embeddings in Figs. 5-1 (d) and 5-3, when con¬ 
sidered on a sphere, can be made to coincide. (Remember that edges can be 
bent, and in a spherical embedding there is no infinite region.) Theorem 5-7, 
due to Whitney, tells us exactly when a graph is uniquely embeddable in a 
sphere. For a proof of the theorem, the reader is referred to [5-9]. 

Theorem 5-7 

The spherical embedding of every planar 3-connected graph is unique. 

This theorem plays a very important role in determining if a graph is 



Fig. 5-6 Two distinct plane representations of the same graph. 
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planar or not. The theorem states that a 3-connected graph, if it can be 
embedded at all, can be embedded in only one way. 

5-5. DETECTION OF PLANARITY 

How to tell if a given graph G is planar or nonplanar is an important 
problem, and “find out by drawing it” is obviously not a good answer. We 
must have some simple and efficient criterion. Toward that goal, we take the 
following simplifying steps: 

Elementary Reduction 

Step 1: Since a disconnected graph is planar if and only if each of its 
components is planar, we need consider only one component at a time. Also, 
a separable graph is planar if and only if each of its blocks is planar. There¬ 
fore, for the given arbitrary graph G, determine the set 

G = {<7,, G 2 , . . . , G k }, 

where each (7, is a nonseparable block of G. Then we have to test each G { for 
planarity. 

Step 2: Since addition or removal of self-loops does not affect planarity, 
remove all self-loops. 

Step 3: Since parallel edges also do not affect planarity, eliminate edges in 
parallel by removing all but one edge between every pair of vertices. 

Step 4: Elimination of a vertex of degree two by merging two edges in 
seriest does not affect planarity. Therefore, eliminate all edges in series. 

Repeated application of steps 3 and 4 will usually reduce a graph drasti¬ 
cally. For example, Fig. 5-7 illustrates the series-parallel reduction of the 
graph of Fig. 5-6(b). 

Let the nonseparable connected graph G t be reduced to a new graph 
after the repeated application of steps 3 and 4. What will graph H i look like? 
Theorem 5-8 has the answer. 

Theorem 5-8 
Graph //, is 

1. A single edge, or 

2. A complete graph of four vertices, or 

3. A nonseparable, simple graph with n > 5 and e > 7. 

fin a graph, two edges are said to be in series if they have exactly one vertex in common 
and if this vertex is of degree two. Edges es and et, (and also e\ and ei) are in series in 
Fig. 5-6. 
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(c) Series Reduced (d) Parallel Reduced 


Fig. 5-7 Series-parallel reduction of the graph in Fig. 5-6(b). 

Proof: The theorem can be proved by considering all connected nonseparable 
graphs of six edges or less. The proof is left as an exercise (Problem 5-9). 

In Theorem 5-8, all H t falling in categories 1 or 2 are planar and need not 
be checked further. 

From now on, therefore, we need to investigate only simple , connected , 
nonseparable graphs of at least five vertices and with every vertex of degree three 
or more. Next, we can check to see if e < 3/7 — 6. If this inequality is not 
satisfied, the graph H t is nonplanar. If the inequality is satisfied, we have to 
test the graph further and, with this, we come to Kuratowski’s theorem 
(Theorem 5-9), perhaps the most important result of this chapter. 

Homeomorphic Graphs: Two graphs are said to be homeomorphic if one 
graph can be obtained from the other by the creation of edges in series (i.e., 
by insertion of vertices of degree two) or by the merger of edges in series. The 
three graphs in Fig. 5-8 are homeomorphic to each other, for instance. A 
graph G is planar if and only if every graph that is homeomorphic to G is 
planar. (This is a restatement of series reduction, step 4 in this section.) 

Theorem 5-9 

A necessary and sufficient condition for a graph G to be planar is that G does 
not contain either of Kuratowski’s two graphs or any graph homeomorphic to 
either of them. 
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Fig. 5-8 Three graphs homeomorphic to each other. 

Proof: The necessary condition is clear, because a graph G cannot be embedded 
in a plane if G has a subgraph that cannot be embedded. That this condition is 
also sufficient is surprising, and its proof is involved. Several different proofs of 
the theorem have appeared since Kuratowski stated and proved it in 1930. For 
a complete proof of the theorem, the reader is referred to Harary [1-5], pages 108- 
112, Berge [1-1], pages 211-213, or Busacker and Saaty [1-2], pages 70-73. 

Note that it is not necessary for a nonplanar graph to have either of the 
Kuratowski graphs as a subgraph, as this theorem is sometimes misstated. 
The nonplanar graph may have a subgraph homeomorphic to a Kuratowski 
graph. For example, the graph in Fig. 5-9(a) is nonplanar, and yet it does not 
have either of the Kuratowski graphs as a subgraph. However, if we remove 





Fig. 5-9 Nonplanar graph with a subgraph homeomorphic to K^ > 3. 
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edges (a, x ) and (A, C ) from this graph, we get a subgraph, as shown in Fig. 
5-9(b). This subgraph is homeomorphic (merge two series edges at vertex x) to 
the one shown in Fig. 5-9(c). The graph of Fig. 5-9(c) clearly is isomorphic to 
K 3i3 , Kuratowski’s second graph, and this demonstrates the nonplanarity of 
the graph in Fig. 5-9(a). 

The example just discussed also points out that although Theorem 5-9 
(Kuratowski’s theorem) gives an elegant and simple-looking criterion for 
planarity of a graph, the theorem is difficult to apply in the actual testing of a 
large graph (say, a simple, nonseparable graph of 25 vertices, each of degree 
three or more). There have been several alternative characterizations of a 
planar graph. One of these characterizations, the existence of a dual graph, is 
the subject of the next two sections. 


5-6. GEOMETRIC DUAL 

Consider the plane representation of a graph in Fig. 5-10(a), with six 
regions or faces F,, F 2 , F 3 , F 4 , F s , and F 6 . Let us place six points p { ,p 2 ,..., 
p 6 , one in each of the regions, as shown in Fig. 5-10(b). Next let us join these 
six points according to the following procedure: 




(a) (b) 



Fig. 5-10 Construction of a dual graph. 
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If two regions F ( and Fj are adjacent (i.e., have a common edge), draw a 
line joining points p ( and pj that intersects the common edge between F t and 
Fj exactly once. If there is more than one edge common between F t and F p 
draw one line between points /?,. and Pj for each of the common edges. For an 
edge e lying entirely in one region, say F k , draw a self-loop at point p k inter¬ 
secting e exactly once. 

By this procedure we obtain a new graph G* [in broken lines in Fig. 
5- 10(c)J consisting of six vertices, /?,, p 2 ,. .. ,p 6 and of edges joining these 
vertices. Such a graph G * is called a dual (or strictly speaking, a geometric 
dual ) of G. 

Clearly, there is a one-to-one correspondence between the edges of graph 
G and its dual G* —one edge of G* intersecting one edge of G. Some simple 
observations that can be made about the relationship between a planar graph 
G and its dual G* are 

1. An edge forming a self-loop in G yields a pendant edget in G*. 

2. A pendant edge in G yields a self-loop in G*. 

3. Edges that are in series in G produce parallel edges in G*. 

4. Parallel edges in G produce edges in series in G*. 

5. Remarks 1-4 are the result of the general observation that the number 
of edges constituting the boundary of a region F t in G is equal to the 
degree of the corresponding vertex p { in G*, and vice versa. 

6. Graph G* is also embedded in the plane and is therefore planar. 

7. Considering the process of drawing a dual G * from G, it is evident that 
G is a dual of G* [see Fig. 5-10(c)]. Therefore, instead of calling G* 
a dual of G, we usually say that G and G* are dual graphs. 

8. If n, e, /, r, and p denote as usual the numbers of vertices, edges, 
regions, rank, and nullity of a connected planar graph G, and if n*, e*, 
/*, r*, and p* are the corresponding numbers in dual graph G*, then 

n* =/, 
e* = e, 
f* = n. 

Using the above relationship, one can immediately get 

r* = p, 
p* - r. 

Uniqueness of Dual Graphs: Is a (geometric) dual of a graph unique? In 

fAn edge incident on a pendant vertex is called a pendant edge. 
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(a) Dual of 5-6(a) (b) Dual of 5-6(b) 

Fig. 5-11 Duals of graphs in Fig. 5-6. 

other words, are all duals of a given graph isomorphic? From the method of 
constructing a dual, it is reasonable to expect that a planar graph G will have a 
unique dual if and only if it has a unique plane representation or unique 
embedding on a sphere. 

For instance, in Fig. 5-6 the same graph (isomorphic) had two distinct 
embeddings, (a) and (b). Consequently, the duals of these isomorphic graphs 
are nonisomorphic, as shown in Fig. 5-11. 

The graphs in Fig. 5-11, however, are 2-isomorphic. Theorem 5-10, stated 
without proof, is a generalization of this example. 

Theorem 5-10 

All duals of a planar graph G are 2-isomorphic; and every graph 2-isomorphic 
to a dual of G is also a dual of G. 

With this qualification in mind, it is quite appropriate to refer to a dual as 
the dual of a planar graph. 

Since a 3-connected planar graph has a unique embedding on a sphere, its 
dual must also be unique. In other words, all duals of a 3-connected graph 
are isomorphic. 


5-7. COMBINATORIAL DUAL 

So far we have defined and discussed duality of planar graphs in a purely 
geometric sense. The following provides us with an equivalent definition of 
duality independent of geometric notions. 

Theorem 5-11 

A necessary and sufficient condition for two planar graphs Gi and G 2 to be 
duals of each other is as follows: There is a one-to-one correspondence between 
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the edges in G x and the edges in G 2 such that a set of edges in G\ forms a circuit if 
and only if the corresponding set in G 2 forms a cut-set. 

Proof: Let us consider a plane representation of a planar graph G. Let us also 
draw (geometrically) a dual G* of G. Then consider an arbitrary circuit T in G. 
Clearly, T will form some closed simple curve in the plane representation of G — 
dividing the plane into two areas. (Jordan Curve Theorem). Thus the vertices of 
G* are partitioned into two nonempty, mutually exclusive subsets—one inside T 
and the other outside. In other words, the set of edges T* in G* corresponding to 
the set r in G is a cut-set in G*. (No proper subset of I"* will be a cut-set in G *; 
why?). Likewise it is apparent that corresponding to a cut-set S* in G* there is a 
unique circuit consisting of the corresponding edge-set S in G such that S is a 
circuit. This proves the necessity portion of Theorem 5-11. 

To prove the sufficiency, let G be a planar graph and let G' be a graph for which 
there is a one-to-one correspondence between the cut-sets of G and circuits of G\ 
and vice versa. Let G* be a dual graph of <7. There is a one-to-one correspondence 
between the circuits of G' and cut-sets of G, and also between the cut-sets of G and 
circuits of G*. Therefore there is a one-to-one correspondence between the circuits 
of G' and G*, implying that G' and G* are 2-isomorphic (Theorem 4-15). According 
to Theorem 5-10, G' must be a dual of G. ■ 

Dual of a Subgraph: Let G be a planar graph and G* be its dual. Let a be 
an edge in G , and the corresponding edge in G* be a*. Suppose that we 
delete edge a from G and then try to find the dual of G — a. If edge a was on 
the boundary of two regions, removal of a would merge these two regions 
into one. Thus the dual ( G — a)* can be obtained from G* by deleting the 
corresponding edge a* and then fusing the two end vertices of a* in G* — a*. 
On the other hand, if edge a is not on the boundary, a* forms a self-loop. In 
that case G* — a* is the same as (<7 — a)*. Thus if a graph G has a dual G*, 
the dual of any subgraph of G can be obtained by successive application of 
this procedure. 

Dual of a Homeomorphic Graph: Let G be a planar graph and G* be its 
dual. Let a be an edge in G, and the corresponding edge in G* be a*. Suppose 
that we create an additional vertex in G by introducing a vertex of degree two 
in edge a (i.e., a now becomes two edges in series). How will this addition 
affect the dual? It will simply add an edge parallel to a* in G*. Likewise, the 
reverse process of merging two edges in series (step 4 in Section 5-5) will 
simply eliminate one of the corresponding parallel edges in G*. Thus if a 
graph G has a dual <7*, the dual of any graph homeomorphic to G can be 
obtained from G* by the above procedure. 

So far we have been studying duality for planar graphs only. This was 
forced upon us because the very definition of duality depended on the graph 
being embedded in a plane. However, now that Theorem 5-11 provides us 
with an equivalent abstract definition of duality (namely, the correspondence 
between circuits and cut-sets), which does not depend on a plane represen¬ 
tation of a graph, we will see if the concept of duality can be extended to 



106 


PLANAR AND DUAL GRAPHS 


CHAP. 5 


nonplanar graphs also. In other words, given a nonplanar graph G, can we 
find another graph G' with one-to-one correspondence between their edges 
such that every circuit in G corresponds to a unique cut-set in G', and vice 
versa? The answer to this question is no, as shown in the following important 
theorem, due to Whitney. 

Theorem 5-12 

A graph has a dual if and only if it is planar. 

Proof: We need prove just the “only if” part. That is, we have only to prove 
that a nonplanar graph does not have a dual. Let G be a nonplanar graph. Then 
according to Kuratowski’s theorem, G contains K $ or K 3p3 or a graph homeomor- 
phic to either of these. We have already seen that a graph G can have a dual only 
if every subgraph g of G and every graph homeomorphic to g has a dual. Thus if 
we can show that neither K s nor K 3 3 has a dual, we have proved the theorem. 
This we shall prove by contradiction as follows: 

(a) Suppose that K 3 3 has a dual D. Observe that the cut-sets in K 3i3 corres¬ 
pond to circuits in D and vice versa (Theorem 5-10). Since K 3t3 has no cut-set 
consisting of two edges, D has no circuit consisting of two edges. That is, D con¬ 
tains no pair of parallel edges. Since every circuit in K 3t3 is of length four or six, 
D has no cut-set with less than four edges. Therefore, the degree of every vertex 
in D is at least four. As D has no parallel edges and the degree of every vertex is 
at least four, D must have at least five vertices each of degree four or more. That is, 
D must have at least (5 x 4)/2 = 10 edges. This is a contradiction, because ^ 3,3 
has nine edges and so must its dual. Thus K 3>3 cannot have a dual. Likewise, 

(b) Suppose that the graph K s has a dual H. Note that K 5 has (1) 10 edges, 
(2) no pair of parallel edges, (3) no cut-set with two edges, and (4) cut-sets with 
only four or six edges. Consequently, graph H must have (1)10 edges, (2) no vertex 
with degree less than three, (3) no pair of parallel edges, and (4) circuits of length 
four and six only. Now graph H contains a hexagon (a circuit of length six), and 
no more than three edges can be added to a hexagon without creating a circuit of 
length three or a pair of parallel edges [see Fig. 5-2(b)]. Since both of these are for¬ 
bidden in H and H has 10 edges, there must be at least seven vertices in H. The 
degree of each of these vertices is at least three. This leads to H having at least 
11 edges. A contradiction. ■ 

This proof of theorem 5-12 is not the one originally given by Whitney. 
Whitney’s proof, though more rigorous, is much more involved. Our proof is 
based on one given by Parson [5-7]. 

There is yet another equivalent combinatorial definition of duality, also 
given by Whitney and proved equivalent to the earlier two definitions [5-10]. 

Two planar graphs G and G* are said to be duals (or combinatorial duals) 
of each other if there is a one-to-one correspondence between the edges of G 
and G * such that if g is any subgraph of G and h is the corresponding subgraph 
of <7*, then 

rank of ( G * — h) = rank of G* — nullity of g. (5-7) 

This relationship is shown diagrammatically in Fig. 5-12. 
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Rank of (G* - h) = Rank of G* - Nullity ofg 
Fig. 5-12 Combinatorial duals. 

As an example, consider the graph in Fig. 5-6(a) and its dual in Fig. 
5-11(a). Take the subgraph {e 4 , e 5 , e 6 , e 7 ] in Fig. 5-6(a) and the corresponding 
subgraph {ef, ef, ef, ef] in Fig. 5-11(a). 

rank of ( G * — [ef, ef, ef, ef}) = rank of {ef, e*, ef] = 2, 

rank of G* = 3, 

nullity of (e 4 , e 5 , e 6 , e n } = 1, 
and 

2 = 3-1. 

Clearly, this definition is also independent of the geometric connotation. 
It is therefore often preferred for proving results in purely algebraic fashion. 
However, in deciding whether or not two given graphs are dual the combina¬ 
torial definitions are difficult to use. 

The proof of equivalence of combinatorial and geometric duals is quite 
involved. The interested reader is referred to the original papers of Whitney 
[5-10, 5-12] or to Seshu and Reed [1-13], pages 45-50. Since the geometric and 
combinatorial duals are one and the same, we simply refer to them as the 
dual, rather than the geometric or combinatorial dual. 

Self-Dual Graphs: If a planar graph G is isomorphic to its own dual, it 
is called a self-dual graph. It can be easily shown that the four-vertex com¬ 
plete graph is a self-dual graph (Problem 5-20). Self-dual graphs have 
interesting properties and pose some unsolved problems. 

5-8. MORE ON CRITERIA OF PLANARITY 

Theorems 5-9 (Kuratowski’s theorem) and 5-12 (Whitney’s theorem) 
provided us with two different and alternative ways of characterizing a planar 
graph. The third classic planarity criterion, due to MacLane [5-6], is given 
next. 

Set of Basic Circuits: A set C of circuits in a graph is said to be a complete 
set of basic circuits if (i) every circuit in the graph can be expressed as a rin? 
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sum of some or all circuits in C, and (ii) no circuit in C can be expressed as 
a ring sum of others in C. The significance of complete sets of basic circuits 
will be clearer in Chapter 6, in relation to the vector space of a graph. It may, 
however, be mentioned here that whereas a set of fundamental circuits (as 
defined in Chapter 3 with respect to a spanning tree) always constitutes a 
complete set of basic circuits, the converse does not hold for all graphs 
(Problem 5-15). 

In a planar graph a complete set of basic circuits has an additional pro¬ 
perty, which we will observe next. 

In a plane representation of a planar, connected graph G the set of circuits 
forming the interior regions constitutes a complete set of basic circuits. For 
any circuit T in G can be expressed as the ring sum of the circuits defining the 
regions contained in T. Observe that every edge appears in at most two of 
these basic circuits. Thus for every planar graph G we can find a complete 
set of basic circuits such that no edge appears in more than two of these 
basic circuits. This result and its converse (proof of which can be found in 
[5-6]) lead to another well-known characterization of planar graphs. 

Theorem 5-13 

A graph C is planar if and only if there exists a complete set of basic circuits 
(i.e., all jli of them, jj, being the nullity of G ) such that no edge appears in more 
than two of these circuits. 

All three of these classic characterizations suffer from two shortcomings. 
First, they are extremely difficult to implement for a large graph. Second, in 
case the graph is planar they do not give a plane representation of the graph. 

These drawbacks have prompted recent discoveries of several map- 
construction methods, where the testing of planarity itself is based on an 
attempt to produce a plane representation of the graph. One such method is 
given by Tutte [5-9]. Several other construction methods, some of them quite 
similar, have been implemented on digital computers [5-2, 5-8], In most of 
these methods, the given graph is first reduced to one or more simple , non- 
separ able graphs with every vertex of degree three or more and with e < 3n — 6. 
Then the construction algorithm is applied such that either one succeeds in 
obtaining a planar realization of the graph or the graph is nonplanar. More 
will be said on such algorithms in Chapter 11. 

Some algorithms are better than others, but all are laborious and time- 
consuming. The search for a simple, elegant, and practical characterization of 
a planar graph is far from over. 


5-9. THICKNESS AND CROSSINGS 

Having found that a given graph G is nonplanar, it is natural to ask, what 
is the minimum number of planes necessary for embedding G? The least 



CHAP. 5 


SUMMARY 


109 


number of planar subgraphs whose union is the given graph G is called the 
thickness of G. In a printed-circuit board, for instance, the number of 
insulation layers necessary is the thickness of the corresponding graph. 

By definition, then, the thickness of a planar graph- is one. The thickness 
of each of Kuratowski’s graphs is clearly two. The reader can show, by 
sketching them, that the thickness of the complete graph of eight vertices is 
two, while the thickness of the complete graph of nine vertices is three 
(Problem 5-19). Although there are several results available on the thickness 
of special types of graphs [1-5, pages 120-121], the thickness of an arbitrary 
graph is in general, difficult to determine. 

Another question one might ask about a nonplanar graph is: What is the 
fewest number of crossings (or intersections) necessary in order to “draw” the 
graph in a plane? 

The crossing number of a planar graph is, by definition, zero, and of 
either of Kuratowski’s graphs, it is one. The crossing numbers of only a few 
graphs have been determined. No formula exists to give the crossing number 
of an arbitrary graph. 


SUMMARY 

Can a given graph be placed in a plane without its edges crossing over? 
This is clearly a geometric question about the graph—an object that exists in 
two different worlds, purely combinatorial and purely geometric. To quote 
Harary [1-5], page 106, “one of the most fascinating areas of study ... is the 
interplay between considering a graph as a combinatorial object and as a 
geometric figure.” 

On probing a bit further, we discovered that we needed to investigate only 
simple, nonseparable graphs which have no vertex of degree less than three. 
Moreover, we found that any graph with the number of edges e > 3n — 6 
need not be investigated any further, because such a graph is nonplanar. 

Three equivalent, but very different, planarity characterizations, those of 
Kuratowski, Whitney, and MacLane, were presented and their significance 
and drawbacks discussed. For graphs that are nonplanar, additional relevant 
properties, such as thickness and number of crossings, were defined and 
discussed. There are many unsolved problems in this field of study. Because of 
the current interest in such areas as automatic wiring of complex systems, 
technology of printed circuits, and design of large-scale integrated circuits, 
these geometrical properties of graphs are of practical importance. 

The existence of a dual graph, in addition to being a condition equivalent 
to that of planarity, is important in its own right. The underlying structural 
relationship between dual graphs becomes very clear in terms of the vector 
space of the graph, a subject for the next chapter. 
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PROBLEMS 

5-1. Using geometric arguments similar to those used in proving Theorem 5-1, prove 
that Kuratowski’s second graph is also nonplanar. 

5-2. If every region of a simple planar graph (with n vertices and e edges) embedded in 
a plane is bounded by k edges, show that 

c - ~ 2) 
k - 2 

5-3. A simple planar graph to which no edge can be added without destroying its 
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planarity (while keeping the graph simple, of course) is called a maximal planar 
graph. Prove that every region in a maximal planar graph is a triangle. 

5-4. Prove that a planar graph of n vertices {n > 4) has at least four vertices with degree 
five or less. This will also prove that there are no 6-connected planar graphs. 
{Hint: Use the result of Problem 5-3.) 

5-5. A planar graph G is said to be completely regular if the degrees of all vertices of G 
are equal and every region is bounded by the same number of edges. The graphs 
in Figs. 2-20(a) and 2-21(b) are completely regular, for example. Show that there 
are only five possible simple completely regular planar graphs, excluding the trivial 
graphs with degree < 2. Sketch them. {Hint: Use Euler’s formula.) 

5-6. Prove that an infinite pattern formed of a regular polygon repeating itself, such as 
those found in mosaics and tiled floors (see infinite graphs in Fig. 1-10), can consist 
of only three types of polygons—square, triangular, and hexagonal. 

5-7. Redraw the graph in Fig. 5-4 such that region 2 becomes the infinite region. 

5-8. Using Kuratowski’s theorem, show that the graphs in Fig. 2-3 (known as Petersen’s 

graph) are nonplanar. 

5-9. By sketching all (don’t panic, their number is small) simple, nonseparable graphs 
with n < 4 and e < 6, prove Theorem 5-8. 

5-10. Draw the geometric dual of the graph in Fig. 5-4. 

5-11. Show by actual construction that the geometric dual of the two (2-isomorphic) 
graphs in Figs. 4-11(a) and (d) are isomorphic. 

5-12. Construct an example to demonstrate that G**, the dual of a dual of a graph G, 
may not be isomorphic to G, but is 2-isomorphic to it. 

5-13. Prove that the geometric dual of a self-loop-free nonseparable planar graph is also 
nonseparable. 

5-14. Prove that a self-loop-free planar graph is 2-connected if and only if its dual is also 
2-connected. 

5-15. Give an example of a graph which has at least one complete set of basic circuits 
not constituting a set of fundamental circuits (with respect to any spanning tree). 

5-16. Show that the edges forming a spanning tree in a planar graph G correspond to 
the edges forming a set of chords in the dual G*. 

5-17. Show that a set of fundamental circuits in a planar graph G corresponds to a set 
of fundamental cut-sets in its dual G*. 

5-18. Determine the number of crossings and the thickness of the graph in Fig. 2-3. 

5-19. Show, by sketching, that the thickness of the eight-vertex complete graph is two, 
whereas that of the nine-vertex complete graph is three. 

5-20. Show that the complete graph of four vertices is self-dual. Give another example 
of a self-dual graph. 




VECTOR SPACES OF A GRAPH 


Modern abstract algebra is a powerful tool in the theory as well as in the 
applications of graphs. It is essential for a thorough understanding of graphs 
and a must for those wishing to do research in the field. Moreover, since 
digital computers do not (at least internally) work on pictorial graphs, it is 
necessary to represent a graph algebraically and to manipulate it algebrai¬ 
cally, if one wishes to enlist the aid of a computer in solving graph-theory 
problems. 


6-1. SETS WITH ONE OPERATION 

Set: A set is a collection of objects (called the elements of the set). 

Note that there is no specification on the nature of the elements or the 
number of elements. Nor do the elements have anything to do with each other, 
except belong to the same set. Braces are used to enclose the elements of a 
set. For instance, a set S' consisting of five objects a , b, c , x, and y may be 
written as S = [a, b, c , x, y}. Since the order in which these elements appear 
is of no significance, we could have written the same set as S == {x, b, a , y, c}, 
for instance. The symbol a e S is used to indicate that element a is in set 
S. 

A subset S' of a set S is a collection of some of the elements of S. If S has 
at least one element that is not in S', then S' is called a proper subset of S. The 
empty set or null set, written 0 , has no element in it and is considered a 
subset of every set. The two most common combinations of sets are the union 
U and intersection n, defined as 
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S 1 U S 2 = S 3 , a set containing all the elements of 5", and S 2 , 

S { n S 2 = a set containing only those elements that are both in S t 

and in S 2 . 

In this chapter we shall be concerned with the combination of two 
elements within a set rather than the combination of two different sets. 

Operation: Let us introduce a rule of combination called binary operation 
(also called binary composition , law of composition , or internal law of composi¬ 
tion) between two elements of a set. Addition, multiplication, subtraction, 
and division are some of the familiar binary operations between two elements 
in a set of numbers. To keep the binary operation general enough, we shall 
use the symbol * (rather than using -f, —, x, -F, etc.) to denote the binary 
operation. A set with operations defined on it is called an algebraic system or 
just algebra. 

Special Types of Algebras: Now we have a set, say S = {a, b, c, . . .], and 
a binary operation * (written as a * b) between the elements of S. Depending 
on the nature of the binary operation *, set S can be classified as one of 
several special types of algebras. For instance, if * satisfies postulates 1 and 2 
below, set S is called a semigroup : 

1. Closure: If a and b are in S, then a * b is also in S. 

2. Associative: If the elements a , b , and c are in S, then (a * b) * c = 
a * (b * c ). 

Semigroups have many interesting properties and have been studied in 
great detail. In fact, there are several thick books written on the theory of 
semigroups. But since semigroups as such are not applicable to the business 
at hand, we shall move on to more specialized semigroups. 

A semigroup that satisfies postulate 3, below, is called a monoid. 

3. Identity element: There exists a unique element e in S' such that for any 
element x in S, x * e = e * x x. 

A monoid that satisfies postulate 4, below, is called a group. 

4. Inverse: For every element x in S there exists a unique element x' in 5" 
such that x * x' = x' * x — e. Element x' is called the inverse of x, 
with respect to operation *. 

A semigroup that satisfies postulate 5, below, is called an abelian semi¬ 
group or commutative semigroup. 

5. Commutative: If a and b are in S, then a * b = b * a. 
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Fig. 6-1 Algebraic systems with one internal operation. 

If an abelian semigroup also has an identity element, it is called an 
abelian monoid (or an abelian semigroup with identity element ). 

A set S with an operation * that satisfies all these five postulates is called 
an abelian group (or a commutative group). 

Figure 6-1 summarizes the definitions of these “algebraic systems” and 
shows the relationships among them. The arrows point toward the direction 
of increasing restriction on the set S'. The number next to a line indicates the 
particular postulate that converts one algebraic system into another. 

It ought to be mentioned here that an algebraic system in which the binary 
operation does not satisfy even the closure and associative rules is of little 
mathematical interest. Another observation that may be made is that postu¬ 
late 4 cannot be satisfied before 3. In view of these two remarks, Fig. 6-1 does 
show all possible combinations of the five postulates. 

Examples: Some examples are in order now. 

Consider the set of all positive integers, S t = {1, 2, 3, . *.}. Set S x satisfies 
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closure and associative rules if the binary operation * is the ordinary addition 
operation +. Moreover, it also satisfies the commutative requirement. Hence 

under addition is a commutative semigroup. Note that in there is no 
identity element (an element when added to any other element results in the 
latter element). 

Consider the same set S { = (1, 2, 3, . . .} under the ordinary division 
operation -i-. Since S’, contains no fractions, clearly S { does not satisfy the 
closure rule, and hence is not a semigroup. 

Again, the same set S l under multiplication • is an abelian monoid, because 
it has an identity element, 1. The set S { , however, is not a group under the 
multiplication operation because S’, does not have the inverse of every element 
(because 5’ 1 has no fractions). 

The set of all integers S 2 = {. . ., —3, —2, — 1, 0, 1, 2, 3,. . .} is an 
abelian group under the addition operation (hence an additive abelian group). 

The reader can verify (Problem 6-2) that the set consisting of the four 
fourth roots of unity, which is [1, — 1, /, —/} (where i — J— 1), is an abelian 
group under the multiplication operation (therefore, a multiplicative abelian 
group). 

Groups of Subgraphs: Now we shall show that sets of certain subgraphs of 
any given graph G satisfy the preceding postulates and thus form their groups. 
These are very fundamental and important results in graph theory. 

Theorem 6-1 

The ring sum of two circuits in a graph G is either a circuit or an edge-disjoint 
union of circuits. 

Proof: Let T 2 and T 2 be any two circuits in a graph G. If the two circuits have 
no edges or vertices in common, their ring sum 0 T 2 is a disconnected sub¬ 
graph of G, and is obviously an edge-disjoint union of circuits. If, on the other 
hand, Tj and T 2 do have edges and/or vertices in common, we have the following 
possible situations: 

Since the degree of every vertex in a graph that is a circuit is two, every vertex 
v in subgraph r t ® T 2 has degree d(v), where 

d(v) =2 if v is in Tj only, or in T 2 only; or if one of the 

edges formerly incident on v was in both F , and F 2 ; or 
d(v) =4 if Tj and T 2 just intersect at v (without a common edge). 

There is no other type of vertex in Tj 0 T 2 . Thus T, 0T 2 is an Euler graph, 
and therefore consists of either a circuit or an edge-disjoint union of circuits 
(Theorem 2-6). ■ 

It is immediate from Theorem 6-1 that the ring sum of any two edge- 
disjoint unions of circuits is also a circuit or another edge-disjoint union of 
circuits. 
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Theorem 6-2 

The set consisting of all the circuits and the edge-disjoint unions of circuits 
(including the null set 0 ) in a graph G is an abelian group under the ring-sum op¬ 
eration ®. 

Proof: It is required to prove that this set under the operation ® satisfies 
postulates 1-5 in this section. That the closure postulate is satisfied has just been 
proved in Theorem 6-1. Associative and commutative postulates are also clearly 
satisfied. The null graph serves as the identity element 0 , because 0 ® g — g, 
for any subgraph g of G. What about the inverse? 

A circuit or an edge-disjoint union of circuits T is its own inverse, because 

r©r- 0. 

Hence the theorem. ■ 

Theorem 6-3 

The set consisting of all the cut-sets and the edge-disjoint unions of cut-sets 
(including the null set 0 ) in a graph G is an abelian group under the ring sum 
operation. 

Proof: It is follows from Theorem 4-4 that this set satisfies the closure axiom. 
Associativity and commutativity are also immediately apparent. And so is the 
existence of the identity element 0 . Just as in the case of circuits, a cut-set or an 
edge-disjoint union of cut-sets is its own inverse. Thus the theorem. ■ 


6-2. SETS WITH TWO OPERATIONS 

Now suppose that on the elements of an abelian group we impose another 
binary operation Q, in addition to the operation * imposed in Section 6-1. The 
five postulates on O can be written as follows (note that these are the same 
postulates as in Section 6-1, but they are for a different binary operation 0) ; 

6. Closure: If a and b are in S, then a O b is also in S. 

7. Associative: If a, b, and c are in S, then (a Q b) (f) c = a Q (b (f) c). 

8. Identity element: There exists a unique element i in S such that for any 
element x in S, x Q i = i Q x = x. This element i is called the iden¬ 
tity element (or unity) with respect to operation O. 

9. Inverse: For every element (except for the identity element e of pos¬ 
tulate 3 in Section 6-1) x in S, there exists a unique element x~ l in S 
such that x (f) x~ l = x~ 1 O x — i. Element x ~ 1 is called the inverse of 
x, with respect to operation Q- 

10. Commutative: If a and b are in S, then a O b = b O a. 


And to relate these two different binary operations, postulate 11 is introduced. 
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11. Distributive: The operation Q ls distributive with respect to the 
operation *; that is, for elements a, b, and c in S 

aQ(b*c)^aQb*aQc, 
and (b*c)Qa = bQa*cQa. 

Just as in Section 6-1, the different combinations of these postulates, in 
addition to postulates 1-5, will render different types of algebraic systems. 
These are 

Ring: An abelian group with respect to * that satisfies postulates 6, 7, and 
11 is called a ring. 

Ring with Unity: A ring that has a unity or identity element i with respect 
to the second operation 0 . 

Commutative Ring: A ring that satisfies the commutative postualate (10) 
with respect to O. 

Commutative Ring with Unity: A commutative ring that has an identity 
element (8) with respect to Q. 

Division Ring (or Skew Field or S-Field): A ring with unity that also sat¬ 
isfies the inverse postulate (9) with respect to Q. 

Field (sometimes called Commutative Field): A division ring that satisfies 
the commutative postulate (10) with respect to Q. Thus a field satisfies all 
eleven postulates, and therefore may be regarded as the “strongest” algebraic 
system considered here. 

The relationship among these algebraic systems is summarized in Fig. 6-2. 

Examples: As mentioned in Section 6-1, the set of all integers 

S 2 = {...,-3,-2,-1,0, 1,2, 3,...} 

is an abelian group under +, the usual addition operation. Moreover, ordi¬ 
nary multiplication between elements of S 2 also satisfies the closure, associa¬ 
tive, distributive, and commutative postulates, and there is a unity element, 1, 
in ,S 2 . Thus S 2 is a commutative ring with unity. However, since S 2 does not 
contain fractions, it does not satisfy postulate 9, and hence S 2 is not a field. 

The set of all rational numbers does satisfy postulate 9, in addition to the 
other ten satisfied by S 2 . Therefore, the set of all rational numbers is a field 
under addition and multiplication. The set of all real numbers also forms a 
field under addition and multiplication. All complex numbers also form a field 
under the usual addition and multiplication. 

In this book we shall mainly be concerned with groups and fields. The 
rest of the algebraic systems are defined simply for your general interest. 
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Fig. 6-2 Algebraic systems with two internal operations. 


6-3. MODULAR ARITHMETIC AND 
GALOIS FIELDS 

Consider a system of numbers that has only three numbers in it, ordinary 
0, 1, and 2. And let the rules for addition and multiplication in this system be 
the same as ordinary addition and multiplication with the following exception: 
If a number q (resulting from addition or multiplication operations) equals or 
exceeds 3, it is to be divided by 3, the quotient is discarded, and the remainder 
is used in place of q. The addition and multiplication tables for such a number 
system are given in Fig. 6-3, and are called addition modulo 3 and multiplica¬ 
tion modulo 3. Together they are called modulo 3 arithmetic. For example, in 
modulo 3 arithmetic, 

1 + 14-2.2+1+2+1 = 1 (mod 3). 

Similarly, we can define any modulo m arithmetic system consisting of m 
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Fig. 6-3 Addition and multiplication tables for arithmetic modulo 3. 

elements 0, 1,2 , ... ,m — 1 and the relationship for any q > m — 1: 

q — rri'p + r = r (mod m) and r < m. 

It is suggested that the reader write down arithmetic tables for m = 4, 5, 6, 
and 7 (Problem 6-7). 

Finite Fields: From the tables in Fig. 6-3, it can be verified that the set 
(0, 1, 2} with addition and multiplication modulo 3 is a field. There is an 
identity 0 with respect to modulo 3 addition, and an identity 1 with respect to 
modulo 3 multiplication. Every element has a unique additive inverse, and 
every element other than 0 has a multiplicative inverse. 

By means of actual tables, like those in Fig. 6-3, it can be easily verfied 
that modulo 2, 5, and 7 systems are also fields. On the other hand, the set 
[0,1,2, 3} with modulo 4 addition and multiplication is not a field, because no 
inverse of 2 exists with respect to modulo 4 multiplication (Problem 6-8). 

In fact, it turns out that every finite set 

Z m = {0, 1,2,. . . , m — 1} 

with modulo m addition and multiplication is a field if and only if m is a 
prime number. Such a field is called a Galois field modulo m, or GF(m). 

As we shall see shortly, in representing graphs we are concerned only with 
GF(2), Galois field modulo 2. It consists of [0, 1} and the addition modulo 2 
and multiplication modulo 2 operations. The two arithmetic tables are given 
in Fig. 6-4. (Those familiar with computer logic will readily recognize that in 
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Fig. 6-4 Addition and multiplication tables of GF(2). 



120 


VECTOR SPACES OF A GRAPH 


CHAP. 6 


Fig. 6-4, 4 - is the same as “EXCLUSIVE OR” and • is the same as “AND” 
of Boolean logic.) 


6-4. VECTORS AND VECTOR SPACES 

In an ordinary two-dimensional (Euclidean) plane, a point is represented 
by an ordered pair of numbers X = (x,, x 2 ). Point X can also be regarded as 
a vector emanating from the origin 0 = (0, 0) to the point (x,, x 2 ). Similarly, 
in three-dimensional Euclidean space the triplet (7,2.1, —3) represents a 
vector. Sometimes, instead of row notation a column notation is used, for 
example, 



The three components 7, 2.1, and —3 in the example above are from the 
field of real numbers. Every point (of the infinitely many points) in E 3 , the 
three-dimensional Euclidean space, corresponds to a unique ordered triplet 
(of the infinitely many triplets) consisting of three real numbers. 

Now suppose that we are working with GF(2), the field of integers modulo 
2. Then every number in a triplet can only be either 0 or 1. Thus there are only 
eight (2 3 = 8) vectors possible (instead of infinitely many as in the real 
number system) in a three-dimensional space if our numbers are restricted to 
GF(2). These are 


( 0 , 0 , 0 ), ( 1 , 0 , 0 ), ( 0 , 1 , 0 ), ( 0 , 0 , 1 ), ( 1 , 1 , 0 ), ( 1 , 0 , 1 ), ( 0 , 1 , 1 ), ( 1 , 1 , 1 ). 


This concept of representing vectors can be extended to representation of 
a vector in k-dimensional space by means of an ordered k-tuple. For instance, 
the 7-tuple (0, 1, 1,0, 1,0, 1,) represents a vector in a seven-dimensional 
vector space over the field GF(2). 

The numbers in a field are sometimes called scalars (to distinguish them 
from vectors). The scalars in the field GF(2) are 0 and 1. 

A vector space, in addition to being made up of k-tuples (from some 
specified field), must satisfy certain other conditions regarding combinations 
of two vectors, or operation of a vector with a scalar, and the like. These can 
be summarized in the following definition. 


Definition 

A k-dimensional vector space (or a linear vector space) over the field F, is an 
object consisting of 

1. A field F (with its set of elements S, and two operations * and O)- 
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2. A set W of ^-tuples (all numbers taken from F). 

3. A binary operation [+] (called vector sum ) between the elements of the set 
W, such that W is an abelian group under this operation (+]. 

4. A binary operation 0 (called scalar multiplication), which when applied 
between any scalar c in Fand a vector X = (x l5 x 2 , .. ., x fc ) in W produces 
another vector P in W. P is called the scalar product of c and X, and is 
given by 

P = C 0 X = (C O * 1 , C O X 2 , . . . , c O x k ). 

Furthermore, scalar multiplication satisfies the following: 

Ci 0 (c 2 0 X) = (cj O c 2 ) O X, where c,,c 2 e F, 
ci 0(Xffl Y ) - c, 0X00, 0 Y, 

(c i * c 2 ) 0 X — Ci 0 X PF1 c 2 0 X, 

1 0 X = X, where 1 is the identity with 
respect to operation O i n F- 

Let us now leave the general vector space, and concern ourselves with the 
specific vector space associated with a graph G. 


6-5. VECTOR SPACE ASSOCIATED WITH 
A GRAPH 


Let us consider the graph G in Fig. 6-5 with four vertices and five edges 
c,, e 2 , e 3 , e 4 , e 5 . Any subset of these five edges (i.e., any subgraph g) of G can 
be represented by a 5-tuple: 


such that 


X (x,, x 2 , x 3 , x 4 , x 5 ) 


x { = 1 if e t is in g and 

x, = 0 if e t is not in g. 


For instance, the subgraph g, in Fig. 6-5 will be represented by (1, 0, 1, 0, 1). 





Fig. 6-5 Graph and two of its subgraphs. 
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Altogether there are 2 5 or 32 such 5-tuples possible, including the zero 
vector 0 = (0, 0, 0, 0, 0), which represents a null graph,t and (1, 1, 1, 1, 1), 
which is G itself. 

It is not difficult to see that the ring-sum operation between two subgraphs 
corresponds to the modulo 2 addition between the two 5-tuples representing 
the two subgraphs. For example, consider two subgraphs 

g, = {e x , e 3 , c 5 ] represented by (1,0, 1, 0, 1), and 

§2 = (<+ e 3 , e 4 ] represented by (0, 1, 1, 1,0). 

The ring sum 


g, © £2 = (+, <+> represented by (1, 1,0, 1, 1), 

which is clearly modulo 2 addition of the 5-tuples for g, and g 2 . 

Now, generalizing this example, we can make the most important observa¬ 
tion of this chapter: There is a vector space W G associated with every graph 
G, and this vector space consists of 

1. Galois field modulo 2; that is, set {1,0} with operation addition modulo 
2 written as + such that 0 + 0 = 0, 1+0=1 =0+1,1 + 1=0, 
and multiplication modulo 2 written as • such that 0*0 = 0= 1*0 = 
0* 1, and 1*1 = 1. 

2. 2 e vectors (c-tuples), where e is the number of edges in G. 

3. An addition operation between two vectors X, Y in this space, defined 
as the vector sumt 

X © Y = (x, + y x , x 2 + y 2 , . . ., x, + y e ), 

+ being addition modulo 2. 

4. And a scalar multiplication between a scalar c in Z 2 and a vector X, 
defined as c*X = (ox,, . . . , ox e ). 

The reader can verify that the vector space W G associated with a graph G, 
as defined above, does indeed satisfy all the requirements of a vector space 

tin considering vector spaces of graphs, isolated vertices are of no consequence. 
Hence a null graph of four vertices is not distinguished from a null graph of 100 vertices. 

JThe same symbol ® has been used for the ring sum of two subgraphs, as well as for 
the vector sum between the two vectors representing the two subgraphs. This is done as 
much to eliminate an extra symbol as to remind the reader that a ring sum between two 
subgraphs amounts to the same thing as vector sum of the corresponding vectors. There 
will be no occasion for ambiguity. 
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(Problem 6-11). Note that the identity element (for the vector sum operation) 
in a vector space is 0, the zero vector. 

6-6. BASIS VECTORS OF A GRAPH 

Linear Dependence: A set of vectors X,, X 2 , . . . , X r (over some field F) is 
said to be linearly independent if for scalars c,, c 2 , . . ., c r in Fthe expression 

c,Xi + c 2 X 2 + • • • + c r X r — 0 

holds only if c, = c 2 = • • • = c r = 0. Otherwise, the set of vectors is said to 
be linearly dependent. For example, consider the set of three vectors, over the 
field of real numbers: 



An arbitrary linear combination of these three vectors set to zero gives 

' c, \ / 0 \ /3 c 3 \ c, + 3c 3 \ /On 

CjXi + c 2 X 2 + c 3 X 3 — 4c, -T I c 2 + [ 0 I = 4c, + c 2 — 0 • 

\ 0 \2cJ \ 0 / \ 2c J \o) 

That is, 2c 2 = 0, 4c, + c 2 == 0, and c, + 3c 3 = 0, which hold only if 
c, = c 2 = c 3 = 0. Thus the set of vectors (X,, X 2 , X 3 ] is linearly independent. 

On the other hand, consider another set of vectors (over the same field of 
real numbers): 



Setting an arbitrary linear combination of these vectors to zero, 

/ c 5 + .5c 6 \ /ON 

c 4 X 4 + c 5 X 5 + c 6 X 6 = j 2c 4 + 2cj I = 0 , 

\—2c 4 + cj \0y 

gives c 4 = — c 5 = .5c 6 = a, where a can be any real number not necessarily 
zero. Therefore, the set [X 4 , X 5 , X 6 ) is linearly dependent. 

Basis Vectors: To the set of three linearly independent vectors (X,, X 2 , X 3 ] 
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in the first example, let us add another vector 


Y = 


fa 


\yj 


Now you can show without much difficulty that the set {Xj,X 2 , X 3 , Y] is 
linearly dependent regardless of what Y is. In other words, you can find a set 
of four real numbers a , b, c, and d (not all of which are zero) such thatf 


aX, + bX 2 + cX 3 + d Y = 0. 

Rewriting Eq. (6-1), 


Y = 



(6-1) 


Thus a vector Y can be expressed as a linear combination of the vectors X 1? 
X 2 , X 3 . Such a set of k linearly independent vectors is called a basis (or the 
coordinate system ) in the vector space. More formally: 

If every vector in a vector space W can be expressed as a linear combina¬ 
tion of a given set of vectors, this set is said to span the vector space W. The 
dimension of the vector space W is the minimal number of linearly independent 
vectors required to span W. Any set of k linearly independent vectors that 
spans W, a ^-dimensional vector space, is called a basis for the vector space W. 

For example, the following set of k unit vectors in a /c-dimensional vector 
space is a basis. This is the most commonly used basis, and is often called the 
natural or standard basis. 


I l \ 

0 

0 


CJ 



l°\ 

o 

0 



\:i 


It is clear that any vector in the A:-dimensional vector space (over the field of 
real numbers) can be expressed as a linear combination of these k vectors. 

Basis Vectors of a Graph: In Section 6-5 it was shown that there was a 


tOne possible solution (out of infinitely many) that satisfies Eq. (6-1) is 

H^ 2 )- »—($)• 
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vector space W G associated with every graph G. Corresponding to each sub¬ 
graph of G there was a vector in W G , represented by an e-tuple. The natural 
basis for this vector space W G is a set of e linearly independent vectors, each 
representing a subgraph consisting of one edge of G. For instance, for the 
graph in Fig. 6-5, the set of the following five vectors serves as a basis for W G . 

( 1 , 0 , 0 , 0 , 0 ), 

( 0 , 1 , 0 , 0 , 0 ), 

( 0 , 0 , 1 , 0 , 0 ), 

( 0 , 0 , 0 , 1 , 0 ), 

( 0 , 0 , 0 , 0 , 1 ). 

Any of the possible 32 subgraphs (including G as well as the null graph) can 
be represented by a suitable (and unique) linear combination of these five 
basic vectors. 


6-7. CIRCUIT AND CUT-SET SUBSPACES 

A nonempty subset of vectors in a space is called a subspace if the subset 
satisfies the axioms of a vector space. To check whether a given subset of 
vectors is a subspace we have only to check for closure under scalar multi¬ 
plication and vector addition. Since the scalar product of 0 and a vector X is 
the zero vector 0, the closure under scalar multiplication assures the presence 
of 0. Closure under scalar multiplication also assures the inverse of every 
vector [because the inverse of vector X is the vector (— 1) • X]. If the associative, 
commutative, and distributive axioms hold in the original space, they must 
also hold for every subset of vectors. Thus a subset of vectors closed under 
vector addition and multiplication by scalars is a subspace. 

A vector space is trivially its own subspace. The null space, consisting of 
0, is also a subspace. A Euclidean plane E 2 through the origin is a subspace of 
the three-dimensional Euclidean space E 3 . A line E x through the origin is a 
subspace of both E t and E 3 . 

The dimension of a subspace is the number of linearly independent vectors 
required to span the subspace. 

Subspaces in Wg 

In the vector space W G (over the Galois field modulo 2) associated with a 
graph G, let us consider the following two types of vectors: A circuit vector is 
a vector in W G representing either a circuit or a union of edge-disjoint circuits 
in graph G. A cut-set vector is a vector in W G representing either a cut-set or a 
union of edge-disjoint cut-sets in G. 
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We know that in the vector space W G the linear combination of two vectors 
(which is simply modulo 2 addition of their components) corresponds to the 
ring sum of the corresponding subgraphs in G. From Theorem 6-2, the ring 
sum of two circuits (or unions of edge-disjoint circuits) is a circuit or a union 
of edge-disjoint circuits. Therefore, the linear combination of two circuit 
vectors is also a circuit vector. Hence 

Theorem 6-4 

The set of all circuit vectors in W G forms a subspace W r . 

Based on parallel arguments and on Theorem 6-3, we have an identical 
result for cut-set vectors. 

Theorem 6-5 

The set of all cut-set vectors in W G forms a subspace IV S . 

Quite naturally, subspaces W v and W s are called the circuit subspace and 
cut-set subspace, respectively. 

Bases of and W r 

After having discovered that a particular set of vectors constitutes a 
subspace, the questions that one asks next are: What is the dimension of this 
subspace? How many vectors does the subspace contain? These questions 
about the subspaces W r and W s are answered by the following important 
results. 

Theorem 6-6 

The set of circuit vectors corresponding to the set of fundamental circuits, 
with respect to any spanning tree, forms a basis for the circuit subspace W v . 

Proof: Consider a spanning tree, T, in a connected graph G, with n — 1 = r 
tree branches and e — n + 1 = p chords. Adding a chord to T produces a 
fundamental circuit, and the corresponding circuit vector can be included in the 
basis of W T . Adding another chord c 2 to subgraph T u produces another 
fundamental circuit, with at least one edge that was not in the previous circuit. 
Therefore, the circuit vector representing the second fundamental circuit and the 
first circuit vector are linearly independent. Thus both these circuit vectors can be 
included in the basis. Adding a third chord to T u c, u c 2 will give another funda¬ 
mental circuit with at least one edge not in either of the previous circuits. Therefore, 
this third circuit vector can also be included in the basis. Continuing with this 
argument, we see that all the p vectors successively obtained this way are linearly 
independent, because each represents a circuit containing at least one edge not 
present in any of the previous ones. Therefore, these p vectors, each corresponding 
to a fundamental circuit, are linearly independent. 
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Now we have to show that every circuit vector is a linear combination of these 
H vectors. 

Consider an arbitrary circuit I", in G, such that 

F i {^1>C2, •••,£’,■, ^m\i 

where edges e t , e 2 ,. . . , e t are chords with respect to T, and e i+ ,, e i+2 , ■ ■ ■ ■> e m are 
branches of T. 

Let g be a subgraph obtained by taking the ring sum of the i fundamental cir¬ 
cuits formed by the chords e u e 2 ,... , and e t . 

Because of Theorem 6-1, subgraph g must be a circuit or a union of edge- 
disjoint circuits. Assrume T } ^ g. Then the subgraph r, 0 g must be either a 
circuit or a union of edge-disjoint circuits. But since both T t and g contain the 
chords e u e 2 ,. . . , and no other chords, the subgraph r, 0 g will not contain 
any chord with respect to T. Hence Fj © #■ has no circuit, a contradiction. So 

r, = g . 

Thus we have shown that any circuit (and by extension a union of edge-disjoint 
circuits) in G can be expressed as a ring sum of some of the fundamental circuits 
with respect to T. The vectors corresponding to a set of fundamental circuits must 
therefore span W r . ■ 

As was brought out in Chapter 5, every set of fundamental circuits con¬ 
stitutes a basis in the circuit subspace W r (i.e., forms a set of basic circuits), 
but every basis in the circuit subspace need not correspond to a set of fun¬ 
damental circuits. (See Problems 5-15 and 6-18.) 

Corollary 

The dimension of the circuit subspace W r is equal to the nullity p of the graph, 
and the number of circuit vectors (including 0) in W r is 2". 

Employing an argument parallel to that used in proving Theorem 6-6, it can be 
shown that the r cut-set vectors, each corresponding to a fundamental cut-set with 
respect to a spanning tree, are linearly independent. 

Also, by a parallel argument it can be proved that any cut-set or a union of 
edge-disjoint cut-sets can be obtained by taking the ring sum of a subset of the r 
fundamental cut-sets with respect to a spanning tree. And thus we get a similar 
result for the cut-set subspace. 

Theorem 6-7 

The set of cut-set vectors corresponding to the set of fundamental cut-sets, 
with respect to any spanning tree, forms a basis for the cut-set subspace tV s . 

Corollary 

The dimension of the cut-set subspace W s is equal to the rank r of the graph, 
and the number of cut-set vectors (including 0) in W s is 2 r . 

Example: Let us now illustrate these results with an example. 
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For the graph G in Fig. 6-5 


number of edges, e — 5, 
rank, r — 3, 
nullity, n — 2. 


The number of vectors in the circuit subspace, therefore, is 2 2 — 4, and these 
are 




1°) 


(') 


l°\ 

1 


10 


[i 


0 

1 

9 

1 

9 

0 

, and 

0 

0 


I 1 


>i 


0 

\o) 


\l/ 


U 


\0 


a basis of W r 


The first two of these vectors correspond to the set of fundamental circuits 
with respect to either of the spanning trees in Fig. 6-5, and therefore they form 
a basis for W r . (In fact, any two of the first three vectors form a basis of W r .) 
The three subgraphs, each corresponding to a nonzero vector in W r , are 
shown in Fig. 6-6. 

The cut-set subspace W s has a dimension of three, and therefore the 
number of vectors in W s is 2 3 = 8. These cut-set vectors are 



The first three vectors correspond to the three fundamental cut-sets with 
respect to the tree g 2 in Fig. 6-5. The rest of the vectors can easily be seen to 
be the vector sums of any two or three of these basis vectors. The seven 




Fig. 6-6 Circuits in graph G of Fig. 6-5. 



sec. 6-8 


ORTHOGONAL VECTORS AND SPACES 


129 



Fig. 6-7 Cut-sets and union of edge-disjoint cut-sets in graph G of 
Fig. 6-5. 


subgraphs, each corresponding to a nonzero cut-set vector, are sketched in 
Fig. 6-7. 

In this example you may have observed that the subgraph {e t , e 2 , e 4 , <? 5 } is 
both a circuit and a union of two edge-disjoint cut-sets. The vector (1, 1,0, 1, 
1) corresponding to this subgraph, therefore, occurs in both subspaces W r 
and W s . 

Another observation you may have also made is that there are at least 
(2 e — 2" — 2 r + 1) nonzero vectors which are neither in W r nor in W s . In 
this example we must have at least 21 (= 2 s — 2 3 — 2 2 + 1) such vectors. 
Since there is one vector common to W r and JV S , we have in fact 22 vectors in 
W G that are neither circuit vectors nor cut-set vectors. 

Having obtained some insight into the circuit subspace and cut-set sub¬ 
space, let us now explore the relationship between these two subspaces. 


6-8. ORTHOGONAL VECTORS AND SPACES 

Consider two vectors (4, 2) and (—3, 6) in a plane (which is also called a 
two-dimensional Euclidean space E 2 ), as shown in Fig. 6-8. These vectors are 
orthogonal because their dot product 4-(—3) + 2*6 = 0. Generalizing this 
notion to a ^-dimensional vector space, we have the following definitions: 

Dot Product: The dot product of two vectors X and Y in a vector space W 
is a scalar quantity defined as 


X-Y = (x„ x 2 , . . . , x k )-(>„ y 2 ,. . . , y k ) 
= x i *Ti + x i‘y 2 + • • • + x k -y k . 
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Fig. 6-8 Pair of orthogonal vectors in 
space Ei. 


Orthogonal Vectors: Two vectors are called orthogonal if their dot product 
is zero; and two subspaces are said to be orthogonal to each other if every 
vector in one is orthogonal to every vector in the other. 

Returning to the vector space associated with a graph G, the dot product 
of two vectors, each representing a subgraph of G, is the modulo 2 sum of the 
products of the corresponding entries in the two vectors. For example, the dot 
product of the vectors representing subgraphs g, and g 2 in Fig. 6-5 is 


(1,0, 1,0, l)-(0, 1, 1, 1,0) = 1-0 + 0-1 + 1-1 + 0-1 

+ 1*0 (mod 2 sum) 
= 0 + 0 + 1 + 0+0 
= 1 . 


The number of nonzero entries in the sum of products above is the number of 
edges common to g { and g 2 . Theorem 6-8 follows directly from the definition 
of the dot product of two vectors. 

Theorem 6-8 

The dot product of two vectors, one representing a subgraph g and the other+, 
is zero if the number of edges common to g and g' is even; the dot product is 1 if 
the number of common edges is odd. 

Theorem 6-9 

In the vector space of a graph, the circuit subspace and the cut-set subspace 
are orthogonal to each other. 

Proof: According to Theorem 4-3, the number of edges common to a circuit 
and a cut-set is even. What about the number of edges common to a union of 
edge-disjoint circuits and a union of edge-disjoint cut-sets? That this is also even 
can be shown as follows: 

Let gj be a union of three edge-disjoint circuits F,, F 2 , and T 3 in a graph G, 
and gi be a union of two edge-disjoint cut-sets 5, and S 2 in G. 

Let the number of edges common to 
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r, and Si be 2a, 
r ] and S 2 be 2b, 
r 2 and S'! be 2c, 

F 2 and S 2 be 2d, 
r 3 and Si be 2e, 

F 3 and S 2 be 2/. 

Since there is no edge common between S x and S 2 , or between Tj and T 2 and 
r 3 , the six sets of common edges enumerated above are all distinct (some may be 
empty). Therefore, the number of edges common to g x and g 2 is 

2a + 2b -\- 2c -1- 2d + 2e + 2/, an even number. 

This example can be extended to g x and g 2 to include the union of any finite 
numbers of edge-disjoint circuits and Cut-sets, respectively. From Theorem 6-8, 
the dot product of a circuit vector and a cut-set vector is zero. Hence every vector 
in each of these subspaces is orthogonal to every vector in the other. Therefore, 
the theorem. ■ 

For instance, the dot product of the cut-set vector (0, 1, 1, 1,0) and the 
circuit vector (1, 1, 1,0, 0) in the example in Section 6-7 (i.e., Fig. 6-5) is 


( 0 , 1 , 1 , 1 , 0 )-( 1 , 1 , 1 , 0 , 0 ) = 0-1 + 1-1 + 1-1 + 1-0 + 0-0 

= 0 (mod 2). 

6-9. INTERSECTION AND JOIN OF W r AND W s 

Given the two subspaces W v and JV S of the vector space W G , it is interest¬ 
ing to ask what is the largest set of vectors that belongs to both circuit 
subspace W r and the cut-set subspace W s ; and what is the smallest set of 
vectors containing both W v and IV S ? Clearly, the null or zero vector 0 is in 
both W T and W s , but there may also be some nonzero vectors contained in 
the intersection W r n W s . For example, the vector 

/>\ 

1 

0 

1 

\il 

for the graph in Fig. 6-5 is in both subspaces. It is not difficult to show that 
the set of vectors W r n W s always forms a vector subspace in W G . 
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On the other hand, the smallest subspace containing both W r and W s 
must contain the set union W r U W s , of course, but (because of the closure 
requirements for a subspace) it will usually contain some additional vectors 
not in W r U W s . For example, for the graph in Fig. 6-5 set W r u W s contains 
10 vectors (union of Figs. 6-6 and 6-7), while the smallest subspace con¬ 
taining set W r U W s , that is, the subspace spanned by the set of vectors in 
W r U W s , consists of 16 vectors. (What are the remaining six subgraphs not 
included in Figs. 6-6 and 6-7?) The subspace spanned by W T u W s is called 
the join of W r and W s , and is written as W r V W s . 

The following is a well-known result from linear algebra: If X and Y are 
two subspaces in a finite-dimensional vector space, then the dimension of 
their join, dim(X V T), is given by 

dim(A" V Y) = dim X + dim Y — dim(Z n Y). 

Using this result, we get 


dim(fF r V W s ) - e ~ dim(fF r n W s ). 

Two subspaces of a vector space are said to be orthogonal complements if the 
subspaces are orthogonal to each other, and they together span the entire 
vector space. Thus we have the following interesting result. 

Theorem 6-10 

Subspaces W T and W s are orthogonal complements if and only if 

dim (W r n W s ) = 0, i.e., W r n W s = 0. 

In other words, a set of basis vectors of W r together with a set of basis 
vectors of fV s form a basis for W G if and only if W r n W s — 0. Consequently, 
any subgraph g of G can be uniquely expressed as a ring sum of two sub¬ 
graphs, one a circuit or an edge-disjoint union of circuits and the other a cut¬ 
set or an edge-disjoint union of cut-sets, if and only if 

W r n W s = 0. 

These properties are illustrated in Fig. 6-9. 

In the case 


W r n W s ^ 0 


we have nonzero vectors each orthogonal to itself. This seemingly peculiar 
situation arises from the finiteness of the field. In fact, the dot product of any 
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w v n w s = 



o 


W V MW S = W G 



The remaining three vectors are uniquely expressed as 



Fig. 6-9 Graph and its different subspaces. 

vector over GF(2) with itself is zero if and only if the vector contains an even 
number of l’s. 

Now, since 

dim (W r n W s ) ^ 0, 


the two subspaces W r and W s are not orthogonal complements. Nor is it 
possible to express every vector in W G as a sum of two vectors, one from W r 
and the other from W s . For example, in Fig. 6-5 no linear combination of 
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vectors in W r and W s will yield the vector 

n\ 

0 

0 . 

0 

W 

In fact, for Fig. 6-5 there are 16 such vectors in W G that are not in W T V W s , 
because 

dim(1V r V W s ) = e - dim(JF r n W s ) = 5 - 1 = 4. 

The reader is encouraged to sketch a figure like Fig. 6-9, using the graph 
in Fig. 6-5. Identify all 32 subgraphs, and place them in subspaces W r , JV S , 

(W s n W r ), and (W s V W r ). For more on properties of these subspaces see 
[6-8] and [6-1]. 

SUMMARY 

In this chapter various algebraic or number systems were introduced, and 
it was shown that to every graph G corresponds a vector space W G over the 
field of integers modulo 2 [i.e., GF(2)]. For a graph G with e edges the 
dimension of W G is e, and the number of vectors in W G is 2 e , each corre¬ 
sponding to a subgraph of G. 

Cut-sets and unions of edge-disjoint cut-sets formed an r-dimensional 
subspace W s in W G . The number of vectors in subspace W s is naturally 2 r , 
each vector corresponding to a cut-set or a union of edge-disjoint cut-sets. 
Similarly, the circuits and union of edge-disjoint circuits correspond to a 
/^-dimensional vector space W r , with 2" vectors. Out of many bases available, 
the set of /* vectors representing all fundamental circuits, with respect to any 
spanning tree, forms a convenient basis in the circuit subspace. Likewise, the 
set of r fundamental cut-sets, with respect to any spanning tree, provides a 
basis in the cut-set subspace. 

The cut-set subspace and circuit subspace of a graph are orthogonal to 
each other. The intersection of these two subspaces is not necessarily [0]; 
that is, there may be nonzero vectors common to cut-set and circuit sub¬ 
spaces. Every one of these vectors in W s D W v is orthogonal to itself, and 
they (including the origin 0) form another vector subspace. The set of 
vectors in the union W s u W r does not necessarily form a vector space. 

It was also shown that W G has, in general, a large number of vectors 
(2 e — 2 M — 2 r + 1 vectors or more) which belong neither to the cut-set 
subspace nor to the circuit subspace. 

On one hand, a graph provides an elegant and concrete example of 
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“spaces” of more than three dimensions, which often appear frighteningly 
mysterious to many nonmathematicians. A graph also provides an example 
of a vector space over a field other than those of usual real or complex 
numbers. On the other hand, a study of the vector space of a graph and the 
nature of different subspaces shows us “what makes a graph tick.” It gives us 
an additional mathematical footing in analysis and applications of graphs, 
such as in coding theory (to be covered in Chapter 12). 

Vectors and matrices are closely related. In the next chapter we will 
explore various matrices associated with a graph, and tie the vector spaces 
and matrices of graphs together. 
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PROBLEMS 

6-1. Show that the usual operation of subtraction does not satisfy the associative axiom. 
6-2. Show that the set of the four fourth roots of unity that is, [1, — 1, /, — /}, satisfies 
all five criteria for being an abelian group under the ordinary multiplication 
operation. 
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6 - 3 . 

6 - 4 . 

6 - 5 . 

6 - 6 . 

6 - 7 . 

6-8. 

6 - 9 . 

6 - 10 . 

6 - 11 . 

6 - 12 . 

6 - 13 . 

6 - 14 . 

6 - 15 . 

6 - 16 . 

6 - 17 . 

6 - 18 . 


6 - 19 . 

6 - 20 . 


Given a set {*, y, z} of three elements, show that there is only one group possible 
with this set. 

From the table in Fig. 6-3(a), show that each element in the set {0,1,2} has a 
unique inverse under modulo 3 addition. What about under multiplication modulo 
3 ? Use the table in Fig. 6-3(b). 

Show that there are only two different groups possible with four elements, and that 
both these groups are abelian. 

Given a set [a, b, c, d } of four elements, construct two four by four tables for 
operations * and O, such that the set is a field. Identify the letters playing the roles 
of identities with respect to * and O (i.e., 0 and 1). 

Write down the addition and multiplication tables for each of modulo 4, 5, 6, and 
7 arithmetics (similar to those in Figs. 6-3 and 6-4). 

From the appropriate table in Problem 6-7, show that not every nonzero element 
(i.e., 1,2, and 3) has a unique inverse under the modulo 4 multiplication operation. 
Show that the modulo 6 system is an abelian ring with unity, but is not a field. 
Prove that in any vector space the null vector 0 is orthogonal to every vector in 
the space. 

Show that W G , as defined in Section 6-5, satisfies all four conditions for being a 
vector space, as stated in Section 6-4. 

In vector space W G , do the vectors associated with the spanning trees of G form 
a vector space over GF(2)? Explain. 

Let C be a graph consisting of a circuit of length four. Depict the four subspaces 
Ws, Wr, Wt n Ws, and Wr V Ws as was done in Fig. 6-9. Draw the corres¬ 
ponding subgraphs. Have all 16 subgraphs of G been accounted for? 

Repeat Problem 6-13 for a complete graph of four vertices. Find a basis for Ws 
and Wr. 

If a graph G is a tree (or a forest), show that the cut-set subspace Ws fills the entire 
vector space W G of graph G. 

Characterize a graph for which the circuit space contains the vector (1,1,..., 1). 
Prove that the number of distinct bases possible in a cut-set subspace is 

-pC2r - 2°)(2 r - 2>)(2 r - 2 2 ). .. (2' - 2'- 1 ), 

where r is the rank of the graph. 

Prove that the number of spanning trees in a connected labeled graph with nullity 
H cannot exceed the number 

—,(2" - 2°)(2^ - 2 1 )(2-“ - 2 2 ) . . . (2m - l «-i). 


(Hint: Associated with each spanning tree there is a distinct basis in subspace Wr, 
corresponding to the set of fundamental circuits. Therefore, there are at least as 
many distinct bases in Wr as the number of different spanning trees.) 

Sketch a graph G that has the following vectors (among others) in its circuit sub¬ 


space: (0, 1, 1, 1, 1,0, 0, 1), (0, 1, 1, 1,0, 1, 1,0), (0, 1,0, 0, 1,0, 1,0), (0,1,0,0,0, 


1,0, 1), (1,0, 1,0, 1, 1,0, 1), (1,0, 1,0, 0,0, 1,0), (1,0,0, 1, 1, 1, 1,0), and (1,0, 


0 , 1 , 0 , 0 , 0 , 1 ). 


Given that a graph is connected and that Wr n Ws ^ 0, investigate further the 
properties of the subgraphs corresponding to the vectors in subspaces (a) Wr n Ws 
and (b) W r V W s . 




MATRIX REPRESENTATION OF GRAPHS 


Although a pictorial representation of a graph is very convenient for a 
visual study, other representations are better for computer processing. A 
matrix is a convenient and useful way of representing a graph to a computer. 
Matrices lend themselves easily to mechanical manipulations. Besides, many 
known results of matrix algebra can be readily applied to study the structural 
properties of graphs from an algebraic point of view. In many applications 
of graph theory, such as in electrical network analysis and operations re¬ 
search, matrices also turn out to be the natural way of expressing the problem. 

In this chapter we shall consider two most frequently used matrix repre¬ 
sentations of a graph. Also a correspondence between some graph-theoretic 
properties and matrix properties will be established. In view of the close tie 
between matrices and vector spaces, this chapter should, in fact, be looked 
upon as a continuation of Chapter 6. A rudimentary knowledge of matrix 
algebra is assumed. 


7-1. INCIDENCE MATRIX 

Let G be a graph with n vertices, e edges, and no self-loops. Define an n 
by e matrix A = [a u ], whose n rows correspond to the n vertices and the e 
columns correspond to the e edges, as follows: 

The matrix element 

a u =1, if y'th edge e } is incident on /th vertex v t , and 
= 0, otherwise. 
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Fig. 7-1 Graph and its incidence matrix. 


Such a matrix A is called the vertex-edge incidence matrix , or simply incidence 
matrix. Matrix A for a graph G is sometimes also written as A(G). A graph 
and its incidence matrix are shown in Fig. 7-1. 

The incidence matrix contains only two elements, 0 and 1. Such a matrix 
is called a binary matrix or a (0, l)-matrix. Let us stipulate that these two 
elements are from Galois field modulo 2.t Given any geometric representa¬ 
tion of a graph without self-loops, we can readily write its incidence matrix. 

•(•Although matrices are customarily defined over a commutative ring with identity, 
which need not be a field (such as the ring of integers), we have defined matrix A over a 
field, GF(2), in keeping with our definition of the vector space Wo in Chapter 6. 
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On the other hand, if we are given an incidence matrix A(G), we can construct 
its geometric graph G without ambiguity. The incidence matrix and the 
geometric graph contain the same information!—they are simply two al¬ 
ternative ways of representing the same (abstract) graph. 

The following observations about the incidence matrix A can readily be 
made: 


1. Since every edge is incident on exactly two vertices, each column of 
A has exactly two l’s. 

2. The number of l’s in each row equals the degree of the corresponding 
vertex. 


3. A row with all 0’s, therefore, represents an isolated vertex. 

4. Parallel edges in a graph produce identical columns in its incidence 
matrix, for example, columns 1 and 2 in Fig. 7-1. 

5. If a graph G is disconnected and consists of two components g, and 
g 2 , the incidence matrix A(G) of graph G can be written in a block- 
diagonal form as 


A(G) = 


rA(g.) 

o ~ 

_ 6 

A(g 2 jJ 


(7-1) 


where A(gJ and A(g 2 ) are the incidence matrices of components g, 
and g 2 . This observation results from the fact that no edge in g, is 
incident on vertices of g 2 , and vice versa. Obviously, this remark is 
also true for a disconnected graph with any number of components. 

6. Permutation of any two rows or columns in an incidence matrix simply 
corresponds to relabeling the vertices and edges of the same graph. 
This observation leads us to Theorem 7-1. 


Theorem 7-1 

Two graphs G x and G 2 are isomorphic if and only if their incidence matrices 
A(G,) and A(G 2 ) differ only by permutations of rows and columns. 

Rank of the Incidence Matrix: Each row in an incidence matrix A(G) may 
be regarded as a vector over GF(2) in the vector space of graph G. Let the 

fJust as in any two alternative methods of representation, some properties are more 
evident in one representation than in the other. For example, the fact that the graph is 
planar is obvious in Fig. 7-1 (a), whereas it is not at all obvious from the matrix in Fig. 
7-1(b). 
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vector in the first row be called A,, in the second row A 2 , and so on. Thus 


A(C) = 


A, 

a 2 


LA 


(7-2) 


Since there are exactly two l’s in every column of A, the sum of all these 
vectors is 0 (this being a modulo 2 sum of the corresponding entries). Thus 
vectors A 1? A 2 , . . . , A„ are not linearly independent. Therefore, the rank of 
A is less than n ; that is, rank A < n — 1. 

Now consider the sum of any m of these n vectors (m < n — 1). If the 
graph is connected, A (G) cannot be partitioned, as in Eq. (7-1), such that 
A(g,) is with m rows and A(g 2 ) with n — m rows. In other words, no m by m 
submatrix of A (G) can be found, for m < n — 1, such that the modulo 2 sum 
of those m rows is equal to zero. 

Since there are only two constants 0 and 1 in this field, the additions of all 
vectors taken m at a time for m = 1,21 exhausts all possible linear 
combinations of n — 1 row vectors. Thus we have just shown that no linear 
combination of m row vectors of A (for m < n — 1) can be equal to zero. 
Therefore, the rank of A(G) must be at least n — 1. 

Since the rank of A(G) is no more than n — 1 and is no less than n — 1, it 
must be exactly equal to n — 1. Hence Theorem 7-2. 


Theorem 7-2 

If A (G) is an incidence matrix of a connected graph G with n vertices, the rank 
of A (G) is n — 1. 

The argument leading to Theorem 7-2 can be extended to prove that the 
rank of A((7) is n — k, if G is a disconnected graph with n vertices and k com¬ 
ponents (Problem 7-3). This is the reason why the number n — k has been 
called the rank of a graph with k components. 

If we remove any one row from the incidence matrix of a connected 
graph, the remaining (n — 1) by e submatrix is of rank n — 1 (Theorem 7-2). 
In other words, the remaining n — 1 row vectors are linearly independent. 
Thus we need only n — 1 rows of an incidence matrix to specify the corre¬ 
sponding graph completely, for n — 1 rows contain the same amount of 
information as the entire matrix. (This is obvious, since given n — 1 rows we 
can easily reconstitute the missing row, because each column in the matrix 
has exactly two 1 ’s.) 
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Such an (n — 1) by e submatrix A f of A is called a reduced incidence 
matrix. The vertex corresponding to the deleted row in A f is called the ref¬ 
erence vertex. Clearly, any vertex of a connected graph can be made the 
reference vertex. 

Since a tree is a connected graph with n vertices and n — 1 edges, its 
reduced incidence matrix is a square matrix of order and rank n — 1. In other 
words, 

Corollary 

The reduced incidence matrix of a tree is nonsingular. 

A graph with n vertices and n — 1 edges that is not a tree is disconnected. 
The rank of the incidence matrix of such a graph will be less than n — 1. 
Therefore, the (n — 1) by (n — 1) reduced incidence matrix of such a graph 
will not be nonsingular. In other words, the reduced incidence matrix of a 
graph is nonsingular if and only if the graph is a tree. 


7-2. SUBMATRICES OF A (G) 

Let g be a subgraph of a graph G, and let A(g) and A(G) be the incidence 
matrices of g and G, respectively. Clearly, A(g) is a submatrix of A(G) (pos¬ 
sibly with rows or columns permuted). In fact, there is a one-to-one corre¬ 
spondence between each n by k submatrix of A(G) and a subgraph of G with 
k edges, k being any positive integer less than e and n being the number of 
vertices in G. 

Submatrices of A(G) corresponding to special types of subgraphs, such 
as circuits, spanning trees, or cut-sets in G, will undoubtedly exhibit special 
properties. Theorem 7-3 gives one such property. 

Theorem 7-3 

Let A(G) be an incidence matrix of a connected graph G with n vertices. An 
(n — 1) by (n — 1) submatrix of A (G) is nonsingular if and only if the n — 1 
edges corresponding to the n — 1 columns of this matrix constitute a spanning tree 
in G. 

Proof: Every square submatrix of order w — 1 in A(G) is the reduced incidence 
matrix of the same subgraph in G with n — 1 edges, and vice versa. From the 
remarks following Theorem 7-2, it is clear that a square submatrix of A(G) is 
nonsingular if and only if the corresponding subgraph is a tree. The tree in this 
case is a spanning tree, because it contains n — 1 edges of the ^-vertex graph. 
Thus the theorem. ■ 
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7-3. CIRCUIT MATRIX 


Let the number of different circuits in a graph G be q and the number of 
edges in G be e. Then a circuit matrix B = [6 l7 ] of G is a q by e, (0, l)-matrix 
defined as follows: 

bij =1, if ith circuit includes y'th edge, and 
= 0, otherwise. 


To emphasize the fact that B is a circuit matrix of graph G , the circuit matrix 
may also be written as B(G). 

The graph in Fig. 7-l(a) has four different circuits, {a, b}, [c, e, g], {d,f, 
g], and fc, d,f, e }. Therefore, its circuit matrix is a 4 by 8, (0, l)-matrix as 
shown: 


B(G) = \ 
4 


a b 
'1 1 
0 0 
0 0 
0 0 


c d e f g h 
0 0 0 0 0 0 ” 
10 10 10 
0 10 110 
11110 0 


(7-3) 


The following observations can be made about a circuit matrix B(G) of 
a graph G: 


1. A column of all zeros corresponds to a noncircuit edge (i.e., an edge 
that does not belong to any circuit). 

2. Each row of B(G) is a circuit vector. 

3. Unlike the incidence matrix, a circuit matrix is capable of representing 
a self-loop—the corresponding row will have a single 1. 

4. The number of l’s in a row is equal to the number of edges in the 
corresponding circuit. 

5. If graph G is separable (or disconnected) and consists of two blocks 
(or components) g, and g 2 , the circuit matrix B((7) can be written in 
a block-diagonal form as 


3(G) 


[B (g.) 

0 ” 

6 

B(g 2 )J 


where B(g]) and B(g 2 ) are the circuit matrices of g, and g 2 . This ob- 
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servation results from the fact that circuits in g, have no edges belong¬ 
ing to g 2 , and vice versa (Problem 4-14). 

6. Permutation of any two rows or columns in a circuit matrix simply 
corresponds to relabeling the circuits and edges. 

7. Two graphs G, and G 2 will have the same circuit matrix if and only if 
G, and G 2 are 2-isomorphic (Theorem 4-15). In other words, (unlike 
an incidence matrix) the circuit matrix does not specify a graph com¬ 
pletely. It only specifies the graph within 2-isomorphism. For instance, 
it can be easily verified that the two graphs in Figs. 4-11 (a) and (d) 
have the same circuit matrix, yet the graphs are not isomorphic. 

An important theorem relating the incidence matrix and the circuit matrix 
of a self-loop-free graph G is 

Theorem 7-4 

Let B and A be, respectively, the circuit matrix and the incidence matrix (of a 
self-loop-free graph) whose columns are arranged using the same order of edges. 
Then every row of B is orthogonal to every row A; that is, 

A-B T = B-A r = 0 (mod 2), (7-4) 

where superscript T denotes the transposed matrix. 

Proof: Consider a vertex v and a circuit T in the graph G. Either v is in F or 
it is not. If v is not in T, there is no edge in the circuit T that is incident on v. On 
the other hand, if v is in T, the number of those edges in the circuit T that are 
incident on v is exactly two. 

With this remark in mind, consider the /th row in A and the yth row in B. 
Since the edges are arranged in the same order, the nonzero entries in the corre¬ 
sponding positions occur only if the particular edge is incident on the /th vertex 
and is also in the yth circuit. 

If the /th vertex is not in the yth circuit, there is no such nonzero entry, and 
the dot product of the two rows is zero. If the /'th vertex is in the yth circuit, there 
will be exactly two l's in the sum of the products of individual entries. Since 
1+1=0 (mod 2), the dot product of the two arbitrary rows—one from A and 
the other from B—is zero. Hence the theorem. ■ 

As an example, let us multiply the incidence matrix and transposed circuit 
of the graph in Fig. 7-1(a), after making sure that the edges are in the same 
order in both. 



144 MATRIX REPRESENTATION OF GRAPHS 


CHAP. 7 


A-B r 


0 0 0 
0 0 0 
0 0 0 
1 1 1 
0 0 1 
1 1 0 

"0 

0 

0 

0 

0 

0 


10 10 
0 111 
0 0 0 0 
0 10 0 
10 0 1 
0 0 0 0 


0 0 0 " 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 


O' 

1 

1 

0 

0 

0 


1 

1 

0 

0 

0 

0 

0 

0 


(mod 2). 


0 0 
0 0 
1 0 
0 1 
1 0 
0 1 
1 1 
0 0 


0 " 

0 

1 

1 

1 

1 

0 

0 


7-4. FUNDAMENTAL CIRCUIT MATRIX 
AND RANK OF B 

A set of fundamental circuits (or basic circuits) with respect to any span¬ 
ning tree in a connected graph, as discussed in Chapters 3 and 6, are the only 
independent circuits in a graph. The rest of the circuits can be obtained as 
ring sums (i.e., linear combinations) of these circuits. Thus, in a circuit matrix, 
if we retain only those rows that correspond to a set of fundamental circuits 
and remove all other rows, we would not lose any information. The remaining 
rows can be reconstituted from the rows corresponding to the set of fun¬ 
damental circuits. For example, in the circuit matrix in Eq. (7-3), the fourth 
row is simply the mod 2 sum of the second and third rows. 

A submatrix (of a circuit matrix) in which all rows correspond to a set of 
fundamental circuits is called a fundamental circuit matrix B f . A graph and 
its fundamental circuit matrix with respect to a spanning tree (indicated by 
heavy lines) are shown in Fig. 7-2. 

As in matrices A and B, permutations of rows (and/or of columns) do not 
affect B f . If n is the number of vertices and e the number of edges in a connect¬ 
ed graph, then B / is an (e — n + 1) by e matrix, because the number of 
fundamental circuits is e — n + 1, each fundamental circuit being produced 
by one chord. 

Let us arrange the columns in B f such that all the e — n + 1 chords 
correspond to the first e — n + 1 columns. Furthermore, let us rearrange 
the rows such that the first row corresponds to the fundamental circuit made 
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(a) 


e 2 e 3 e 6 e l e 4 e 5 e 7 


” 1 

0 

0 

! 

1 

1 

1 

1 

1 

0 

1 " 

0 

1 

0 

1 

1 

1 

1 

0 

1 

0 

1 

_0 

0 

1 

1 

1 

1 

0 

0 

1 

1 _ 


(b) 

Fig. 7-2 Graph and its fundamental circuit matrix (with respect 
to the spanning tree shown in heavy lines). 

by the chord in the first column, the second row to the fundamental circuit 
made by the second, and so on. This indeed is how the fundamental circuit 
matrix is arranged in Fig. 7-2(b). 

A matrix B f thus arranged can be written as 

B, = [I, I BJ, (7-5) 

where \ M is an identity matrix of order n = e — n + 1, and B f is the remain¬ 
ing fi by (n — 1) submatrix, corresponding to the branches of the spanning 
tree. 

From Eq. (7-5) it is clear that the 

rank ofB f — n — e — n 1 . 

Since B f is a submatrix of the circuit matrix B, the 

rank of B > e — n + 1. 

In fact, we can prove Theorem 7-5. 

Theorem 7-5 

If B is a circuit matrix of a connected graph G with e edges and n vertices, 

rank of B = e — n + 1. 
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Proof: If A is an incidence matrix of G, from Eq. (7-4) we have 

A-B r =^ 0 (modi). 

Therefore, according to Sylvester’s theorem (Appendix B), 

rank of A + rank of B < e; 

that is, 


Since 
we have 
But 

Therefore, we must have 


rank of B < e — rank of A. 
rank of A = n — 1 
rank of B < <? — « -f 1. 
rank of B > e — n + 1. 


rank of B = e — n + 1. ■ 


An Alternative Proof: Theorem 7-5 can also be proved by considering the 
circuit subspace W v in the vector space W G of a graph, as discussed in Chapter 6. 

Every row in circuit matrix B is a vector in W r , and since the rank of any 
matrix is equal to the number of linearly independent rows (or columns) in the 
matrix, we have. 

rank of matrix B = number of linearly independent rows in B; 

but the number of linearly independent rows in B < number of linearly independent 
vectors in W r , and the number of linearly independent vectors in W v = dimension 
of W v = ju. Therefore, rank of B < e — n + 1. Since we already showed that 
rank of B > e — n -f 1, Theorem 7-5 follows. ■ 

Note that in talking of spanning trees of a graph G it is necessary to as¬ 
sume that G is connected. In the case of a disconnected graph, we would have 
to consider a spanning forest and fundamental circuits with respect to this 
forest. It is not difficult to show (considering component by component) that 
if G is a disconnected graph with k components, e edges, and n vertices, 

rank ofB = fi = e — nf-k. 


7-5. APPLICATION TO A SWITCHING NETWORK 

Suppose you are given a box that contains a switching network consisting 
of eight switches a , b, c, d , e,f, g , and h. The switches can be turned on or off 
from outside. You are asked to determine how the switches are connected 
inside the box, without opening the box, of course. 

One way to find the answer is to connect a lamp at the available terminals 
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The box 


Fig. 7-3 Black box with a switching network. 

in series with a battery and an additional switch k , as shown in Fig. 7-3. And 
then find out which of the various combinations light up the lamp. 

In this experiment, suppose you discover that the combinations that turn 
on the lamp are eight: 

( a,b,f,h,k ), ( a,b,g,k ), (a, e,f, g, k), (a, e, h, k), 

( b,c,e,h,k ), ( c,f,h,k ), (c, g, k), ( d, k ). 

Solution: Consider the switching network as a graph whose edges repre¬ 
sent switches. We can assume that the graph is connected, and has no self¬ 
loop. Since a lit lamp implies the formation of a circuit, we can regard the 
preceding list as a partial list of circuits in the corresponding graph. With 
this list we form a circuit matrix: 

abode f g h k 

1 r i ioooioi r 

2 110 0 0 0 10 1 

3 1 0 0 0 1110 1 

4 100010011 

5 0 1 10 10 0 1 1 

6 0 0 1 0 0 1 0 1 1 

7 0 0 1 0 0 0 10 1 

8[_0 00 10000 1_ 

Next, to simplify the matrix, we should remove the obviously redundant 
circuits. Observe that the following ring sums of circuits give rise to other 
circuits: 

(a, b , g, k) © (c,/, h , k) © (c, g, k) = ( a , A,/, A, k), 

(a, b, g, k ) © (a, e, h, k) © ( c , g, k) = ( b , c, e, h, k), 

(a, e , A, k) © (c,/, A, k) © (c, g, k) = (a , <?,/, g, k). 
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Therefore, we can delete the first, third, and fifth rows from matrix B, without 
any loss of information. Remaining is a 5 by 9 matrix B,: 


a 

b 

c 

d 

e 

/ 

8 

h 

k 


1 

1 

0 

0 

0 

0 

1 

0 

r 



1 

0 

0 

0 

1 

0 

0 

1 

1 


B, = 

0 

0 

1 

0 

0 

1 

0 

1 

1 

• 


0 

0 

1 

0 

0 

0 

1 

0 

1 



^0 

0 

0 

1 

0 

0 

0 

0 

1_ 



Our next goal is to bring matrix B, to the form of Eq. (7-5). For this we 
interchange columns to get B 2 : 


b 2 


b e f 
"1 0 0 
0 1 0 
0 0 1 
0 0 0 
0 0 0 


8 

1 

0 

0 

1 

0 


d 

0 

0 

0 

0 

1 


a c h k 

i o o r 
10 11 
0 111 
0 10 1 
0 0 0 1 


Adding the fourth row in B 2 to the first, we get B 3 . 


b 

e 

/ 

8 

d 

a 

c 

h 

k 

- 1 

0 

0 

0 

0 

1 

1 

0 

0^ 

0 

1 

0 

0 

0 

1 

0 

1 

1 

0 

0 

1 

0 

0 

0 

1 

1 

1 

0 

0 

0 

1 

0 

0 

1 

0 

1 

_0 

0 

0 

0 

1 

0 

0 

0 

1 


We note that there are no redundant circuits in matrix B 3 , and B 3 is a 
fundamental circuit matrix of the required graph. Since the rank of B 3 is 
five, and the network was assumed to be connected, we have the following 
information about the graph: 


number of edges e = 9, 
nullity n = 5, 
rank r = 4, 
number of vertices n = 5. 
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Constructing a graph from its incidence matrix is simple, but constructing 
a graph from its fundamental circuit matrix is difficult. We shall, therefore, 
construct an incidence matrix from B 3 . 

Since the rows in the incidence matrix are orthogonal to those in B 3 — 
according to Eq. (7-4)—we must first look for a 4 by 9 matrix M, whose rows 
are linearly independent and are orthogonal to those of B 3 . 

Since, 


B. 


[I 5 ! F], 


an orthogonal matrix to B 3 is 


M = [-F|U 
= [F^IIJ, 

because in mod 2 arithmetic —1 — 1, [i.e., in GF(2) the additive inverse of 
1 is 1]. 

Thus 


b 

e 

/ 

g 

d 

a 

c 

h 

k 

"1 

1 

0 

0 

0 

1 

0 

0 

0 " 

1 

0 

1 

1 

0 

0 

1 

0 

0 

0 

1 

1 

0 

0 

0 

0 

1 

0 

_0 

1 

1 

1 

1 

0 

0 

0 

1 _ 


Clearly, the rank of M is four, and it is easy to check that 

B 3 -M r = 0. 

Before M can be regarded as a reduced incidence matrix, it must have at 
most two l’s in each column. This can be achieved by adding (mod 2) the 


third row to the fourth in M, which gives us M'. 




b 

e 

/ g 

d 

a 

c 

h 

k 

n 

1 

0 0 

0 

1 

0 

0 

or 

M' = 

1 

0 

1 1 

0 

0 

1 

0 

0 

0 

1 

1 0 

0 

0 

0 

1 

0 


_0 

0 

0 1 

1 

0 

0 

1 

i 


Matrix M' is the reduced incidence matrix. The incidence matrix A can 
be obtained by adding a fifth row to M' such that there are exactly two l’s 
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in every column; that is, 

befgdachk 

ri loooiooo' 

10 1 10 0 10 0 

A = 0 1 1 0 0 0 0 1 0- 

0 0 0 110 0 11 
0 0 0 0 1 1 1 0 1 

From the incidence matrix A we can readily construct the graph and hence 
the corresponding switching network, as shown in Fig. 7-4. 



(b) 


Fig. 7-4 Graph and the corresponding switching network. 
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7-6. CUT-SET MATRIX 

Analogous to a circuit matrix, we can define a cut-set matrix C = [c i} ] in 
which the rows correspond to the cut-sets and the columns to the edges of the 
graph, as follows: 

c u — 1, if /th cut-set contains yth edge, and 
= 0, otherwise. 

For example, a graph and its cut-set matrix are shown in Fig. 7-5. 

The following remarks may be made about a cut-set matrix C(G) of a 
graph G. 

1. As in the case of the incidence matrix, a permutation of rows or 
columns in a cut-set matrix corresponds simply to a renaming of the 
cut-sets and edges, respectively. 

2. Each row in C(G) is a cut-set vector. 

3. A column with all 0’s corresponds to an edge forming a self-loop. 

4. Parallel edges produce identical columns in the cut-set matrix (e.g., 
first two columns in Fig. 7-5). 

5. In a nonseparable graph, every set of edges incident on a vertex is a 
cut-set (Problem 4-8). Therefore, every row of incidence matrix A(G) 
is included as a row in the cut-set matrix C(G). In other words, for a 
nonseparable graph G, C(G) contains A(G). For a separable graph, the 
incidence matrix of each block is contained in the cut-set matrix. For 
example, the incidence matrix of the block {c, d, e,f, g] in Fig. 7-5 is 
the 4 by 5 submatrix of C left after deleting rows a, b, and h and col¬ 
umns 1, 2, 5, and 8. 

6. In view of observation 5, 

rank of C(G) > rank of A(G). 

Hence, for a connected graph of n vertices, 

rank of C(G) > n — 1. (7-6) 

7. Since the number of edges common to a cut-set and a circuit is always 
even, every row in C is orthogonal to every row in B, provided the 
edges in both B and C are arranged in the same order. In other words, 


B*C r = C* B r = 0 (mod 2). 


(7-7) 
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a b c d e f g 

1 To o o o o o o 

2 1 1 0 0 0 0 0 

3 0 0 10 10 0 

4 0 0 0 0 1 1 1 

c = 

5 0 0 10 0 11 

6 0 0 0 10 10 

7 0 0 1 10 0 1 

8 0 0 0 1 10 1 

Fig. 7-5 Graph and its cut-set matrix. 
On applying Sylvester’s theorem to Eq. (7-7), 
rank of B + rank of C < e, 
and since for a connected graph 



rank of B = e — n + 1, 
rank of C < n — 1. 


(7-8) 
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RELATIONSHIPS AMONG Ay, By, AND Cy 

Combining Eqs. (7-6) and (7-8), 

rank of C = n — 1 . 

Thus we have the following important theorem for a connected graph G. 
Theorem 7-6 

The rank of cut-set matrix C(G) is equal to the rank of the incidence matrix 
A(C7), which equals the rank of graph G. 

As in the case of the circuit matrix, the cut-set matrix generally has many 
redundant (or linearly dependent) rows. Therefore, it is convenient to define 
a fundamental cut-set matrix, Cy, as follows: 

A fundamental cut-set matrix Cy (of a connected graph G with e edges 
and n vertices) is an (n — 1) by e submatrix of C such that the rows corre¬ 
spond to the set of fundamental cut-sets with respect to some spanning tree. 

As in the case of a fundamental circuit matrix, a fundamental cut-set 
matrix C f can also be partitioned into two submatrices, one of which is an 
identity matrix l„_, of order n — 1. That is, 

Cy = [C c ! (7-9) 

where the last n — 1 columns forming the identity matrix correspond to the 
n — 1 branches of the spanning tree, and the first e — n + 1 columns form¬ 
ing C c correspond to the chords. 

A connected graph and a fundamental cut-set matrix with respect to a 
spanning tree (shown in heavy lines) are given in Fig. 7-6. 

Again note that in talking of cut-set matrices we have confined ourselves 
to connected graphs only. This treatment can be generalized to include dis¬ 
connected graphs by considering one component at a time. 

7-7. RELATIONSHIPS AMONG A /( B /( AND Cy 

In this section we shall explore the relationships among the reduced in¬ 
cidence matrix Ay, the fundamental circuit matrix By, and the fundamental 
cut-set matrix C f of a connected graph. 

It has been shown that 

By = [I, i BJ, (7-5) 

Cy = [C e j !._>], (7-9) 

where subscript t denotes the submatrix corresponding to the branches of a 
spanning tree, and subscript c denotes the submatrix corresponding to the 
chords. 

Let the spanning tree T in Eqs. (7-5) and (7-9) be the same, and let the 
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Fig. 7-6 Spanning tree in a graph and the corresponding funda¬ 
mental cut-set matrix. 


order of the edges in both equations be the same. Furthermore, in the reduced 
incidence matrix A f —of size (n — 1) by e —let the edges (i.e., the columns) 
be arranged in the same order as in B^- and C f . Partition A f into two sub¬ 
matrices: 

A, = [A c !AJ, (7-10) 

where A, consists of the n — 1 columns corresponding to the branches of the 
spanning tree T, and A c is the remaining submatrix corresponding to the 
e — n + 1 chords. 

Since the columns in A f and B / are arranged in the same order, from Eq. 
(7-4) we have (in mod 2 arithmetic) 


A r &} = 0 . 

That is, [A c i A,]- -gr - = °> 

A c + A f -Bf = 0. 


and 


(7-11) 
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Since A, is nonsingular, its inverse A f _1 exists. Premultiplying both sides 
of Eq. (7-11) by A f “ 1 , we get 


Since in mod 2 arithmetic 


Ar‘-A c = —Bf. 

1 = 1 , 

Bf = A~‘ *A C . 


(7-12) 


(7-13) 


Similarly, since the columns in B^. and C f are arranged in the same order, 
according to Eq. (7-4), we have (in mod 2 arithmetic) 


That is, 


[CJ !„-,]■ 


C/* B/ 

K 

L b r_, 


0 . 

0 , 


Bf 


- B J 
= A, 1 -A, 


(7-14) 

(7-15) 




from (7-13). 


For example, let us look at the following three matrices for the graph used 
in Figs. 7-1, 7-5, and 7-6. Using {a, e,f, g, h} as the spanning tree, and drop¬ 
ping the sixth row from matrix A in Fig. 7-1 to get A f , we have 


B, 


b 

c 

d 

a 

e 

/ 

g 

h 

0 

0 

1 

0 

0 

1 

0 

(T 


0 

0 

0 

0 

1 

1 

1 

1 


0 

0 

0 

0 

0 

0 

0 

1 

— [A 

1 

1 

0 

1 

1 

0 

0 

0 


_0 

1 

1 

0 

0 

0 

1 

0^ 


b 

c 

d 

a 

e 

/ 

g 

h 

- 1 

0 

0 

1 

0 

0 

0 

0' 


0 

1 

0 

0 

1 

0 

1 

0 

= [l 3 

0 

0 

1 

0 

0 

1 

1 

0 



b 

c 

d 

a 

e 

/ 

g 

h 

- 1 

0 

0 

1 

0 

0 

0 

(T 

0 

1 

0 

0 

1 

0 

0 

0 

0 

0 

1 

0 

0 

1 

0 

0 

0 

1 

1 

0 

0 

0 

1 

0 

_0 

0 

0 

0 

0 

0 

0 

1 


= [CJI,]. 
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B f = C c is immediate. It can also be readily verified that 

Ar*-A e = B r. 

This leads to three conclusions: 

1. Given A or A f , we can readily construct B 7 and C f , starting from an 
arbitrary spanning tree and its subgraph A, in A f . 

2. Given either B / or C / , we can construct the other. Thus since B^ de¬ 
termines a graph within 2-isomorphism, so does C^-. 

3. Given either B / or C f , A f in general cannot be determined completely. 

7-8. PATH MATRIX 

Another (0, l)-matrix often convenient to use in communication and 
transportation networks is the path matrix. A path matrix is defined for a 
specific pair of vertices in a graph, say (x, y ), and is written as P(x, y). The 
rows in P(jc, y) correspond to different paths between vertices x and y, and 
the columns correspond to the edges in G. That is, the path matrix for (x, y) 
vertices is P(x, y) = where 

Pu = 1, if j'th edge lies in /th path, and 
= 0, otherwise. 

As an illustration, consider all paths between vertices v 3 and v 4 in Fig. 7-1(a). 
There are three different paths; {h, e), {h, g, c], and {h,f, d, c}. Let us number 
them 1, 2, and 3, respectively. Then we get the 3 by 8 path matrix P(v 3 ,v 4 ): 

a b c d e f g h 

i ro o o o i o o r 

P(v 3 , v A ) = 2 0 0 1 0 0 0 1 1 • 

3 Lo 0 1 10 10 1 

Some of the observations one can make at once about a path matrix 
P(x, .y) of a graph G are 

1. A column of all 0’s corresponds to an edge that does not lie in any 
path between x and y. 

2. A column of all l’s corresponds to an edge that lies in every path 
between x and y. 

3. There is no row with all 0’s. 

4. The ring sum of any two rows in P(x, corresponds to a circuit or an 
edge-disjoint union of circuits. 
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Theorem 7-7 

If the edges of a connected graph are arranged in the same order for the columns 
of the incidence matrix A and the path matrix P(a, y ), then the product (mod 2) 

A• P T (x, >>) = M, 

where the matrix M has l’s in two rows .v and y, and the rest of the n — 2 rows 
are all 0’s. 

Proof: The proof is left as an exercise for the reader (Problem 7-14). 


As an example, multiply the incidence matrix in Fig. 7-1 to the transposed 
P(v 3 , vf), just discussed. 


A-P r (v 3 , w 4 ) 


"0 0 01010 0 
0 0 0 0 1 1 I 1 

0 0 0 0 0 0 0 1 

1 1 10 10 0 0 

0 0 1 10 0 10 
1 10 0 0 0 0 0 


"0 0 0 " 
0 0 0 
0 l 1 
0 0 1 
1 0 0 
0 0 1 
0 1 0 
1 1 1 


V, 

V 2 

V 3 

V 4 

v 5 


v 6 


1 

"0 

0 

1 

1 

0 

0 


2 

0 

0 

1 

1 

0 

0 


3 

0" 

0 

1 

1 

0 

0 


(mod 2). 


Other properties of the path matrix, such as the rank, are left for the 
reader to investigate on his own. It should be noted that a path matrix con¬ 
tains less information about the graph in general than any of the matrices 
A, B, or C does. 


7-9. ADJACENCY MATRIX 

As an alternative to the incidence matrix, it is sometimes more convenient 
to represent a graph by its adjacency matrix or connection matrix. The ad¬ 
jacency matrix of a graph G with n vertices and no parallel edges is an n by n 
symmetric binary matrix X = [x iV ] defined over the ring of integers such that 

Xu = 1, if there is an edge between ith and y'th vertices, and 

= 0, if there is no edge between them. 
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0 10 0 11 

10 0 1 10 

0 0 0 10 0 
0 110 11 
1 10 10 0 

10 0 10 0 

Fig. 7-7 Simple graph and its adjacency matrix. 

A simple graph and its adjacency matrix are shown in Fig. 7-7. 
Observations that can be made immediately about the adjacency matrix 
X of a graph G are 

1. The entries along the principal diagonal of X are all 0’s if and only if 
the graph has no self-loops. A self-loop at the ith vertex corresponds to 

*it = 1 • 

2. The definition of adjacency matrix makes no provision for parallel 
edges. This is why the adjacency matrix X was defined for graphs 
without parallel edges.t 



tSome authors (see Busacker and Saaty [1-2], page 109, for example) define xu as 
equal to the number of edges incident on both vertices / and j, and thus take into account 
parallel edges. 
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3. If the graph has no self-loops (and no parallel edges, of course), the 
degree of a vertex equals the number of l’s in the corresponding row 
or column of X. 

4. Permutations of rows and of the corresponding columns imply reorder¬ 
ing the vertices. It must be noted, however, that the rows and columns 
must be arranged in the same order. Thus, if two rows are interchanged 
in X, the corresponding columns must also be interchanged. Hence two 
graphs G, and G 2 with no parallel edges are isomorphic if and only if 
their adjacency matrices X(Gj) and X(G 2 ) are related: 

X(G 2 ) = R-'-X(G,).R, 
where R is a permutation matrix. 

5. A graph G is disconnected and is in two components and g 2 if and 
only if its adjacency matrix X(G) can be partitioned as 


X(G) 


*(£,)• 

_____ 


X(g 2 ). 


where X(g,) is the adjacency matrix of the component g, and X(g 2 ) is 
that of the component g 2 . 

This partitioning clearly implies that there exists no edge joining 
any vertex in subgraph g x to any vertex in subgraph g 2 . 

6. Given any square, symmetric, binary matrix Q of order n, one can 
always construct a graph G of n vertices (and no parallel edges) such 
that Q is the adjacency matrix of G. 


Powers of X: Let us multiply by itself the 6 by 6 adjacency matrix of the 
simple graph in Fig. 7-7. The result, another 6 by 6 symmetric matrix X 2 , is 
shown below (note that this is ordinary matrix multiplication in the ring of 
integers and not mod 2 multiplication): 


3 10 3 10“ 
13 112 2 

X2== ° 1 1 ° 1 1 
3 10 4 10 

12 113 2 

_0 2 1 0 2 2 _ 


The value of an off-diagonal entry in X 2 , that is, ijth entry (z ^ j) in X 2 , 
=-- number of l’s in the dot product of zth row and /th column (or jth 
row) of X. 

= number of positions in which both zth andy'th rows of X have l’s. 

— number of vertices that are adjacent to both zth and jth vertices. 

-= number of different paths of length two between zth andyth vertices 
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Similarly, the ith diagonal entry in X 2 is the number of l’s in the ith row 
(or column) of matrix X. Thus the value of each diagonal entry in X 2 equals 
the degree of the corresponding vertex, if the graph has no self-loops. 

Since a matrix commutes with matrices that are its own power, 

X-X 2 - X 2 -X = X 3 . 

And since the product of two square symmetric matrices that commute is 
also a symmetric matrix, X 3 is a symmetric matrix. (Again note that this is 
an ordinary product and not mod 2.) 

The matrix X 3 for the graph of Fig. 7-7 is 

_ 2 7 3 2 7 6~ 

7 4 1 8 5 2 

X 3 = 3 1 0 4 I 0 

2 8 4 2 8 7 

7 5 1 8 4 2 

_6 2 0 7 2 0_ 

Let us now consider the ijth entry of X 3 . 

ijth entry of X 3 = dot product of ith row X 2 and yth column (or row) of 

X. 

= X ikth entry of X 2 • Ay'th entry of X. 

k= 1 
n 

— X number of all different edge sequencesf of three 

k = \ 

edges from ith to yth vertex via Arth vertex. 

= number of different edge sequences of three edges 
between ith and yth vertices. 

For example, consider how the 1,5th entry on X 3 for the graph of Fig. 
7-7 is formed. It is given by the dot product 

row 1 of X 2 * row 5 ofX = (3, 1,0, 3, 1,0) - (1, 1,0, 1,0,0) 

= 3+14-0 + 3 + 0 + 0 = 7. 

These seven different edge sequences of three edges between v x and v 5 are 

1> +> ^2}’ 1^2’ ^2}’ {^6’ ^61 ^2)’ {^2» ^3]’ 

[e 6 ,e 7 , e 5 j, [e 2 ,e 5 ,e 5 ], {e,,e 4 ,e 5 }. 

fAn edge sequence is a sequence of edges in which each edge (except, of course, the 
first and the last) has one vertex in common with the edge preceding it and one vertex 
in common with the edge following it. A path, a circuit, and a walk are all special examples 
of an edge sequence. An edge may appear more than once in an edge sequence. 
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Clearly this list includes all the paths of length three between v x and v 5 , that is, 
[c 6 , e 7 , c 5 ] and (c,, e 4 , e 5 }. 

It is left as an exercise for the reader to show (Problem 7-19) that the /7th 
entry in X 3 equals twice the number of different circuits of length three (i.e., 
triangles) in the graph passing through the corresponding vertex v r 

The general result that includes the properties of X, X 2 , and X 3 discussed 
so far is expressed in Theorem 7-8. 

Theorem 7-8 

Let X be the adjacency matrix of a simple graph G. Then the /yth entry in X r 
is the number of different edge sequences of r edges between vertices r f and v } . 

Proof: The theorem holds for r - 1, and it has been proved for r — 2 and 3 also. 
It can be proved for any positive integer r, by induction. 

In other words, assume that it holds for r — 1, and then evaluate the /yth entry 
in X, with the help of the relation 


X' _= X r ~ 1 • X, 

as was done for X 3 . 

The rest of the proof is left as an exercise (Problem 7-17). 

Corollary A 

In a connected graph, the distance between two vertices v t and vj (for i ^ j) 
is k, if and only if k is the smallest integer for which the /, yth entry in x k is nonzero. 

This is a useful result in determining the distances between different pairs 
of vertices. 

Corollary B 

If X is the adjacency matrix of a graph G with n vertices, and 

Y = X + X 2 + X 3 + • • • + X n_I , (in the ring of integers), 

then G is disconnected if and only if there exists at least one entry in matrix Y 
that is zero. 

Relationship Between A (G) and X(G): Recall that if a graph G has no 
self-loops, its incidence matrix A (G) contains all the information about G. 
Likewise, if G has no parallel edges, its adjacency matrix X(G) contains all the 
information about G. Therefore, if a graph G has neither self-loops nor parallel 
edges (i.e., G is a simple graph), both A (G) and X(G) contain the entire in¬ 
formation. Thus it is natural to expect that either matrix can be obtained 
directly from the other, in the case of a simple graph. This relationship is 
given in Problem 7-23. 
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SUMMARY 

The theory of matrices has been brought to bear upon the theory of 
graphs. The use of matrices in studying graphs has been amply demonstrated 
in this chapter. 

We have seen that there are several matrices which can be associated with 
graphs. Two of these, the incidence matrix A and the adjacency matrix X, 
describe a simple graph completely, that is, up to isomorphism. Two others, 
the circuit matrix B and the cut-set matrix C, display some important features 
of the graph and describe the graph only within 2-isomorphism. The path 
matrix P(x, y) contains even less information than B or C does. 

To see further into the structure of the graph, we investigated these 
matrices, pulled out submatrices A f , B /5 C f , 1^, l n _,, B„ B c , C { , and C c , and 
studied them and their interrelationships. 

The properties brought out in this chapter do not by any means exhaust 
the list. Many interesting and useful results are contained in the problems 
of this chapter. 

The converse problem of finding a graph to represent a given matrix has 
been touched upon lightly in Section 7-5. The problem of realizability, that is, 
what conditions must a given matrix B satisfy so that a graph can be found 
whose circuit matrix is B, is very useful and interesting. We shall encounter 
this problem of realizability again in Chapter 12. 
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PROBLEMS 

7-1. Write the incidence matrices for the labeled simple graphs shown in Figs. 1-12 and 
4-1 (b). Put the incidence matrix of the graph of Fig. 4-1 (b) in the block-diagonal 
form of Eq. (7-1). 
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7-2. Consider the graph in Fig. 4-3. With respect to the spanning tree [b, c, e, h, k), 
write matrices Ay, By, and Cy in the forms of Eqs. (7-10), (7-5), and (7-9), respec¬ 
tively. Verify by actual computation Eqs. (7-13) and (7-15). 

7-3. Show that for a simple disconnected graph of k components, n vertices, and e edges 
the ranks of matrices A, B, and C are n — k, e — n + k, and n — k, respectively. 

7-4. Label the edges of the graph in Fig. 4-8, and write down its circuit matrix B. 
Verify observations 1-5 made in Section 7-3 about the properties of matrix B. 

7-5. Draw two nonisomorphic, connected, simple, and nonseparable graphs G i and Gz, 
with as small a number of edges as you can, such that the circuit matrices B((7i) 
= B(C 2 ). {Hint: G i and Gz are 2-isomorphic, and must be 2-connected.) 

7-6. A black box containing a switching network of seven switches—1,2, 3, 4, 5, 6, and 
7—was subjected to the experiment shown in Fig. 7-3. The lamp was lit when each 
of the following combinations of switches was turned on, in addition to the external 
switch k, of course: (1,4, 5), (1,4, 6, 7), (2, 5, 7), (2, 6), (3, 5), and (3, 6, 7). Show 
the switching network configuration. 

7-7. In Section 7-5 a graph was obtained corresponding to a given fundamental circuit 
matrix. Similarly, sketch a procedure for obtaining a graph if its fundamental cut-set 
matrix Cy is given. Can you get two different (nonisomorphic) graphs for the same 
Cy? If yes, how are these different graphs related? 

7-8. Show that you can determine a graph within 2-isomorphism if you were given the 
set of all spanning trees. {Hint: From the set of all spanning trees every cut-set 

can be determined, using Theorem 4-2. And the set of all cut-sets determines a 

graph within 2-isomorphism.) 

7-9. If the following is the list of all spanning trees of a graph G, determine G. 

[a,e,d,e], { a,c,d,f j, {b,c,d,e}, {b,c,d,f}, 

[a, c, e,f}, {b, c, e,f], {a,d,e,f}, {b,d,e,f}, 

{a, b, d, e }, {a, b, d,f}, {a, b, e,f). 

7-10. Express the relationship of dualism between two planar, simple graphs in terms of 
appropriate matrices. 

7-11. Characterize simple, self-dual graphs in terms of their circuit and cut-set matrices. 

7-12. Prove that 

B/ = [U!A^A f - |T ], 

Cy — A “ 1 • Ay. 

7-13. Write down the path matrix P(vj, v 6 ) for the graph in Fig. 4-3. Verify observations 
1-4 in Section 7-8 and Theorem 7-7. 

7-14. Prove Theorem 7-7. 

7-15. Characterize Ay, By, Cy, and X matrices of the complete graph of n vertices. 

7-16. After having labeled the graph in Fig. 4-8 (as required in Problem 7-4), write its 
adjacency matrix X. How does the fact that the graph is separable reflect in X? 
Characterize the adjacency matrix X of a separable graph, in general. 

7-17. Complete the proof of Theorem 7-8. 

7-18. The diameter of a connected graph is defined (Chapter 3) as the largest distance 
between two vertices in the graph. Given the adjacency matrix X, how will you 
determine the diameter of the corresponding graph? {Hint: Consider a sum of the 
powers of X.) 

7-19. Show that each diagonal entry in X 3 equals twice the number of triangles passing 
through the corresponding vertex. 
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7-20. Prove that the number of spanning trees in a connected graph equals the value of 

det(A/-Aj), 

where A/ is the reduced incidence matrix of the graph, and the arithmetic operations 
are carried out in the real field and not mod 2. 

7-21. Similar to the circuit or cut-set matrix, define a spanning-tree matrix for a connected 
graph, and observe some of its properties. 

7-22. Let C be the cut-set matrix of a nonseparable graph G, and let C(x, >>) be the 
submatrix of C, containing only those rows of C that represent cut-sets with respect 
to vertices x and y. Show that C(x, y) contains a fundamental cut-set matrix C/ of 
G. 

7-23. For a labeled graph G of n vertices, define an n by n diagonal matrix D (called the 
degree matrix of G ) such that the /th diagonal entry in D equals the degree of the 
/th vertex in G. Define another matrix E, obtained from the incidence matrix A 
of G by arbitrarily replacing one of the two l’s in every column by a —1. Show 
that if G is a simple, connected graph the following holds (the computations are in 
the ring of integers and not mod 2): 

(a) E-E 7 = D - X. 

(b) All cofactors of the matrix D — X are equal. 

(c) Each cofactor of D — X equals the number of spanning trees in G, where X is 
as usual the adjacency matrix of G. 

7-24. Use the result obtained in Problem 7-23(c) to prove Cayley’s formula (Theorem 
3-10). 

7-25. Let x and y be a pair of vertices in a simple nonseparable graph G, and P(x, y) 
be the corresponding path matrix of G. Prove that every circuit in G is obtained 
as a mod 2 sum of two rows of P(x, j). From this result, prove that a path matrix 
in a simple, nonseparable graph determines the graph within 2-isomorphism. 
[Hint: Every circuit T in G falls in one of three categories: (1) T passes through 
both x and y ; (2) T passes through neither x nor y; or (3) T passes through either 
x or y. Consider all three cases, and use Theorem 4-11.] 

7-26. Prove that for a connected, self-loop-free graph G, subspaces Wr and Ws are 
orthogonal complements of Wg over GF(2) if and only if the number of spanning 

trees in G is odd. [Hint: Define a new e by e matrix M = Compute det(MM r ), 

using the identity C/B£ = B/C£ = 0, and the Binet-Cauchy theorem (see Appendix 
A). Show that det(MM r ) = 1 (mod 2) if and only if G has an odd number of 
spanning trees.] 



8 COLORING, COVERING, AND 
PARTITIONING 


Suppose that you are given a graph G with n vertices and are asked to 
paint its vertices such that no two adjacent vertices have the same color. What 
is the minimum number of colors that you would require? This constitutes 
a coloring problem. Having painted the vertices, you can group them into 
different sets—one set consisting of all red vertices, another of blue, and so 
forth. This is a partitioning problem. The coloring and partitioning can, of 
course, be performed on edges or vertices of a graph. In the case of a planar 
graph, one may even be interested in coloring the regions. These are the types 
of questions to be considered in this chapter. 

Earlier we came across the subject of partitioning the edges of a given 
graph into sets with some specified properties. For example, finding a span¬ 
ning tree in a connected graph is equivalent to partitioning the edges into two- 
sets—one set consisting of the edges included in the spanning tree, and the 
other consisting of the remaining edges. Identification of a Hamiltonian cir¬ 
cuit (if it exists) is another partitioning of the set of edges in a given graph. 

The coloring and partitioning of vertices (or edges) is not performed out 
of mere playfulness, as it may appear from this introduction. Partitioning is 
applicable to many practical problems, such as coding theory, partitioning 
of logic in digital computers, and state reduction of sequential machines. 


8-1. CHROMATIC NUMBER 

Painting all the vertices of a graph with colors such that no two adjacent 
vertices have the same color is called the proper coloring (or sometimes simply 
coloring) of a graph. A graph in which every vertex has been assigned a color 


165 



166 


COLORING, COVERING, AND PARTITIONING 


CHAP. 8 



(a) (b) (c) 

Fig. 8-1 Proper colorings of a graph. 

according to a proper coloring is called a properly colored graph. Usually a 
given graph can be properly colored in many different ways. Figure 8-1 shows 
three different proper colorings of a graph. 

The proper coloring which is of interest to us is one that requires the 
minimum number of colors. A graph G that requires k different colors for its 
proper coloring, and no less, is called a K-chromatic graph, and the number 
k is called the chromatic number of G. You can verify that the graph in Fig. 
8-1 is 3-chromatic. 

In coloring graphs there is no point in considering disconnected graphs. 
How we color vertices in one component of a disconnected graph has no 
effect on the coloring of the other components. Therefore, it is usual to in¬ 
vestigate coloring of connected graphs only. All parallel edges between two 
vertices can be replaced by a single edge without affecting adjacency of 
vertices. Self-loops must be disregarded. Thus for coloring problems we need 
to consider only simple, connected graphs. 

Some observations that follow directly from the definitions just introduced 
are 

1. A graph consisting of only isolated vertices is I -chromatic. 

2. A graph with one or more edges (not a self-loop, of course) is at least 
2-chromatic (also called bichromatic). 

3. A complete graph of n vertices is ^-chromatic, as all its vertices are 
adjacent. Hence a graph containing a complete graph of r vertices is 
at least r-chromatic. For instance, every graph having a triangle is at 
least 3-chromatic. 

4. A graph consisting of simply one circuit with n > 3 vertices is 2- 
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chromatic if n is even and 3-chromatic if n is odd. (This can be seen by 
numbering vertices 1,2,, ... ,, n in sequence and assigning one color to 
odd vertices and another to even. If n is even, no adjacent vertices will 
have the same color. If n is odd, the «th and first vertex will be adjacent 
and will have the same color, thus requiring a third color for proper 
coloring.) 

Proper coloring of a given graph is simple enough, but a proper coloring 
with the minimum number of colors is, in general, a difficult task. In fact, 
there has not yet been found a simple way of characterizing a /c-chromatic 
graph. (The brute-force method of using all possible combinations can, of 
course, always be applied, as in any combinatorial problem. But brute force 
is highly unsatisfactory, because it gets out of hand as soon as the size of the 
graph increases beyond a few vertices.) Chromatic numbers of some specific 
types of graphs will be discussed in the rest of this section. 

Theorem 8-1 

Every tree with two or more vertices is 2-chromatic. 

Proof: Select any vertex v in the given tree T. Consider T as a rooted tree at 
vertex v. Paint v with color 1. Paint all vertices adjacent to v with color 2. Next, 
paint the vertices adjacent to these (those that just have been colored with 2) 
using color 1. Continue this process till every vertex in T has been painted. (See 
Fig. 8-2). Now in T we find that all vertices at odd distances from v have color 2, 
while v and vertices at even distances from v have color 1. 

Now along any path in T the vertices are of alternating colors. Since there is 
one and only one path between any two vertices in a tree, no two adjacent vertices 
have the same color. Thus Thas been properly colored with two colors. One color 
would not have been enough (observation 2 in this section). ■ 

Though a tree is 2-chromatic, not every 2-chromatic graph is a tree. (The 
utilities graph, for instance, is not a tree.) What then is the characterization 

l 


Fig. 8-2 Proper coloring of a tree. 
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of a 2-chromatic graph? Theorem 8-2 (due to Konig) characterizes all 2- 
chromatic graphs. 

Theorem 8-2 

A graph with at least one edge is 2-chromatic if and only if it has no circuits 
of odd length. 

Proof: Let G be a connected graph with circuits of only even lengths. Consider 
a spanning tree T in G. Using the coloring procedure and the result of Theorem 
8-1, let us properly color T with two colors. Now add the chords to T one by one. 
Since G had no circuits of odd length, the end vertices of every chord being replaced 
are differently colored in T. Thus G is colored with two colors, with no adjacent 
vertices having the same color. That is, G is 2-chromatic. 

Conversely, if G has a circuit of odd length, we would need at least three colors 
just for that circuit (observation 4 in this section). Thus the theorem. ■ 

An upper limit on the chromatic number of a graph is given by Theorem 
8-3, whose proof is left as an exercise (Problem 8-1). 

Theorem 8-3 

If <f m ix is the maximum degree of the vertices in a graph G, 
chromatic number of G < 1 T */ max . 

Brooks [8-1] showed that this upper bound can be improved by 1 if G has no 
complete graph of r/ max + 1 vertices. In that case 

chromatic number of G < r/ max . 

A graph G is called bipartite if its vertex set V can be decomposed into 
two disjoint subsets K, and V 2 such that every edge in G joins a vertex in V, 
with a vertex in V 2 . Thus every tree is a bipartite graph. So are the graphs in 
Figs. 8-6 and 8-8. Obviously, a bipartite graph can have no self-loop. A set of 
parallel edges between a pair of vertices can all be replaced with one edge 
without affecting bipartiteness of a graph. 

Clearly, every 2-chromatic graph is bipartite because the coloring parti¬ 
tions the vertex set into two subsets V , and V 2 such that no two vertices in 
V j (or V 2 ) are adjacent. Similarly, every bipartite graph is 2-chromatic, with 
one trivial exception; a graph of two or more isolated vertices and with no 
edges is bipartite but is 1-chromatic. 

In generalizing this concept, a graph G is called p-partite if its vertex set 
can be decomposed into p disjoint subsets V n V 2 ,... , V p , such that no edge 
in G joins the vertices in the same subset. Clearly, a K-chromatic graph is 
/^-partite if and only if 


k < p. 
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With this qualification, the results of this section on /c-chromatic graphs 
are applicable to /c-partite graphs also. 


8-2. CHROMATIC PARTITIONING 

A proper coloring of a graph naturally induces a partitioning of the ver¬ 
tices into different subsets. For example, the coloring in Fig. 8-1 (c) produces 
the partitioning 

b„v 4 }» i v 2 }» and [v 3 ,v 5 ). 

No two vertices in any of these three subsets are adjacent. Such a subset 
of vertices is called an independent set; more formally: 

A set of vertices in a graph is said to be an independent set of vertices or 
simply an independent set (or an internally stable set) if no two vertices in the 
set are adjacent. For example,' in Fig. 8-3, [a, c, d] is an independent set. A 
single vertex in any graph constitutes an independent set. 

A maximal independent set (or maximal internally stable set) is an inde¬ 
pendent set to which no other vertex can be added without destroying its 
independence property. The set ( a , c, d,f } in Fig. 8-3 is a maximal indepen¬ 
dent set. The set [b,f] is another maximal independent set. The set { b, g] is a 
third one. From the preceding example, it is clear that a graph, in general, 
has many maximal independent sets; and they may be of different sizes. 
Among all maximal independent sets, one with the largest number of vertices 
is often of particular interest. 

Suppose that the graph in Fig. 8-3 describes the following problem. Each 
of the seven vertices of the graph is a possible code word to be used in some 
communication. Some words are so close (say, in sound) to others that they 
might be confused for each other. Pairs of such words that may be mistaken 
for one another are joined by edges. Find a largest set of code words for a 
reliable communication. This is a problem of finding a maximal independent 
set with largest number of vertices. In this simple example, {a, c, d,f] is an 
answer. 



Fig. 8-3 
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The number of vertices in the largest independent set of a graph G is called 
the independence number (or coefficient of internal stability ), fi(G). 

Consider a /c-chromatic graph G of n vertices properly colored with k 
different colors. Since the largest number of vertices in G with the same color 
cannot exceed the independence number fi(G), we have the inequality 


P(G) > • 

/v 

Finding a Maximal Independent Set: A reasonable method of finding a 
maximal independent set in a graph G will be to start with any vertex v of G 
in the set. Add more vertices to the set, selecting at each stage a vertex that 
is not adjacent to any of those already selected. This procedure will ultimately 
produce a maximal independent set. This set, however, is not necessarily a 
maximal independent set with.a largest number of vertices. 

Finding All Maximal Independent Sets: A reasonable (but not very effici¬ 
ent for large graphs) method for obtaining all maximal independent sets in 
any graph can be developed using Boolean arithmetic on the vertices. Let 
each vertex in the graph be treated as a Boolean variable. Let the logical (or 
Boolean) sum a + b denote the operation of including vertex a or b or both; 
let the logical multiplication ab denote the operation of including both ver¬ 
tices a and b , and let the Boolean complement a denote that vertex a is not 
included. 

For a given graph G we must find a maximal subset of vertices that does 
not include the two end vertices of any edge in G. Let us express an edge (x, j>) 
as a Boolean product, xy, of its end vertices x and y, and let us sum all such 
products in G to get a Boolean expression 

(p — X x y for all (x, y) in G. 

Let us further take the Boolean complement rp' of this expression, and express 
it as a sum of Boolean products: 

<p' = f\ + •*•+/*• 

A vertex set is a maximal independent set if and only if (p = 0 (logically false), 
which is possible if and only if q>' — 1 (true), which is possible if and only if at 
least one f. = 1, which is possible if and only if each vertex appearing in f 
(in complemented form) is excluded from the vertex set of G. Thus each f. 
will yield a maximal independent set, and every maximal independent set will 
be produced by this method. This procedure can be best explained by an 
example. For the graph G in Fig. 8-3, 
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(p — ab + be + bd + be + ce + de + ef + eg -f- fg, 
q>' = (a + b’)(b’ + c'){b' + d'W + e')(c' + e')(d' + e) 
(e + fW + g')(f + g'). 


Multiplying these out and employing the usual identities of Boolean arith¬ 
metic, such as 


aa = a, 
a + a — a, 
a + ab = a , 


we get 


<p' = b'e'f b'e'g' f ac'd’e'f + a'c'd'e'g' + b'c'd'f'g. 


Now if we exclude from the vertex set of G vertices appearing in any one of 
these five terms, we get a maximal independent set. The five maximal inde¬ 
pendent sets are 

aedf, aedg, bg, bf, and ae. 


These are all the maximal independent sets of the graph. 

Finding Independence and Chromatic Numbers: Once all the maximal 
independent sets of G have been obtained, we find the size of the one with 
the largest number of vertices to get the independence number fl(G). The 
independence number of the graph in Fig. 8-3 is four. 

To find the chromatic number of G, we must find the minimum number 
of these (maximal independent) sets, which collectively include all the ver¬ 
tices of G. For the graph in Fig. 8-3, sets {a, c,d,f}, {b,g}, and {a,e}, for 
example, satisfy this condition. Thus the graph is 3-chromatic. 

Chromatic Partitioning: Given a simple, connected graph G, partition all 
vertices of G into the smallest possible number of disjoint, independent sets. 
This problem, known as the chromatic partitioning of graphs, is perhaps the 
most important problem in partitioning of graphs. 

By enumerating all maximal independent sets and then selecting the 
smallest number of sets that include all vertices of the graph, we just solved 
this problem. The following four are some chromatic partitions of the graph 
in Fig. 8-3, for example. 


{(«, c, </,/), {b, g), (c)), 
{{a, c, d, g), ( b,f ), (<?)}, 
{(c, d,f), (b, g), (a, e)j, 
{(c, d, g), ( b,f ), (a, e)}. 
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Fig. 8-4 A 3-chromatic graph. 

This method of chromatic partitioning (requiring enumeration of all 
maximal independent sets) is inefficient and needs prohibitively large 
amounts of computer memory. A more efficient method for computer imple¬ 
mentation is proposed in [8-6]. 

Uniquely Colorable Graphs: A graph that has only one chromatic partition 
is called a uniquely colorable graph. The graph in Fig. 8-3 is not a uniquely 
colorable graph, but the one in Fig. 8-4 is (Problem 8-2). For some interesting 
properties of uniquely colorable graphs, the reader is referred to Chapter 12 
of [1-5]. 

A concept related to that of the independent set and chromatic partition¬ 
ing is the dominating set, to be discussed next. 

Dominating Sets: A dominating set (or an externally stable set) in a graph 
G is a set of vertices that dominates every vertex v in G in the following sense: 
Either v is included in the dominating set or is adjacent to one or more ver¬ 
tices included in the dominating set. For instance, the vertex set {b, g] is a 
dominating set in Fig. 8-3. So is the set {a,b,c,d,f} a dominating set. A 
dominating set need not be independent. For example, the set of all its ver¬ 
tices is trivially a dominating set in every graph. 

In many applications one is interested in finding minimal dominating sets 
defined as follows: 

A minimal dominating set is a dominating set from which no vertex can be 
removed without destroying its dominance property. For example, in Fig. 
8-3, {b, e} is a minimal dominating set. And so is {a,c,d,f}. Observations 
that follow from these definitions are 

1. Any one vertex in a complete graph constitutes a minimal dominating 
set. 

2. Every dominating set contains at least one minimal dominating set. 

3. A graph may have many minimal dominating sets, and of different 
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sizes. [The number of vertices in the smallest minimal dominating set 
of a graph G is called the domination number , a((7).] 

4. A minimal dominating set may or may not be independent. 

5. Every maximal independent set is a dominating set. For if an inde¬ 
pendent set does not dominate the graph, there is at least one vertex 
that is neither in the set nor adjacent to any vertex in the set. Such a 
vertex can be added to the independent set without destroying its in¬ 
dependence. But then the independent set could not have been max¬ 
imal. 

6. An independent set has the dominance property only if it is a maximal 
independent set. Thus an independent dominating set is the same as a 
maximal independent set. 

7. In any graph G, 

a (G) < 0(G). 

Finding Minimal Dominating Sets: A method for obtaining all minimal 
dominating sets in a graph will now be developed. The method, like the one 
for finding all maximal independent sets, also uses Boolean arithmetic. 

To dominate a vertex v ( we must either include v t or any of the vertices 
adjacent to v r A minimum set satisfying this condition for every vertex v ( is 
a desired set. Therefore, for every vertex v i in G let us form a Boolean product 
of sums ( v . + v h -j- v h T- • • • + v ia ), where v. t , v h , . . . , v u are the vertices 
adjacent to v., and d is the degree of v. : 

9 = n ( v i + *>„ + v n d-b v u) for all v, in G. 

When 9 is expressed as a sum of products, each term in it will represent a 
minimal dominating set. Let us illustrate this algorithm using the graph of 
Fig. 8-3: 

Consider the following expression 9 for Fig. 8-3: 

9 = {a + b){b + c + d + e + a)(c + b + e)(d + b + e) 

(e + b + c T d + / + g)(f + e + g)(g + e + /). 

Since in Boolean arithmetic (x 4- y)x = x, 

9 = (a + b)(b + c + e)(b + d + e){e +/ + g) 

= ae + be + bf + bg -f- acdf + acdg. 

Each of the six terms in the preceding expression represents a minimal 
dominating set. Clearly, a (G) — 2, for this example. 
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8-3. CHROMATIC POLYNOMIAL 


In general, a given graph G of n vertices can be properly colored in many 
different ways using a sufficiently large number of colors. This property of a 
graph is expressed elegantly by means of a polynomial. This polynomial is 
called the chromatic polynomial of G and is defined as follows: 

The value of the chromatic polynomial P„(A) of a graph with n vertices 
gives the number of ways of properly coloring the graph, using X or fewer 
colors. 

Let c ; be the different ways of properly coloring G using exactly i different 
colors. Since i colors can be chosen out of X colors in 


different ways, 

there are j different ways of properly coloring G using exactly / colors 
out of X colors. 

Since i can be any positive integer from 1 to n (it is not possible to use more 
than n colors on n vertices), the chromatic polynomial is a sum of these terms; 
that is, 

w -1 i •) 



A , A(A — ]) , „ A(A 

"'71 + 2 ! + 




DU - 2) , 

3! 

n + 1) 


Each <?,. has to be evaluated individually for the given graph. For example, 
any graph with even one edge requires at least two colors for proper coloring, 
and therefore 

c, = 0. 


A graph with n vertices and using n different colors can be properly colored 
in n\ ways; that is, 

= n\. 


As an illustration, let us find the chromatic polynomial of the graph given 
in Fig. 8-4. 

. , X(X - 1) , X(X - \)(X - 2) 

P 5 (X) = c t X + c 2 -^—- + c 3 v A 


+ C A 


2 3 3! 

X(X - 1)(A - 2)(X - 3) „ X(X - l)(X - 2){X - 3)(A - 4) 

A I 1^5 CI 


4! 


5! 
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Since the graph in Fig. 8-4 has a triangle, it will require at least three 
different colors for proper coloring. Therefore, 

c, = c 2 — 0 and c 5 = 5!. 

Moreover, to evaluate c 3 , suppose that we have three colors x, y, and z. 
These three colors can be assigned properly to vertices v x , v 2 , and v 3 in 3! = 6 
different ways. Having done that, we have no more choices left, because 
vertex v 5 must have the same color as v 3 , and v A must have the same color as 
v 2 . Therefore, 

c 3 = 6. 

Similarly, with four colors, v x , v 2 ,and v 3 can be properly colored in 4-6 = 
24 different ways. The fourth color can be assigned to v 4 or v 5 , thus providing 
two choices. The fifth vertex provides no additional choice. Therefore, 

c 4 = 24*2 = 48. 

Substituting these coefficients in P 5 {X), we get, for the graph in Fig. 8-4, 

P 5 (X) = A(A - 1)(A - 2) + 2X(X - 1)(A - 2)(X - 3) 

+ X(X - \)(X - 2)(X - 3)(A - 4) 

= A(A - 1)(A - 2)(A 2 - 5A + 7). 

The presence of factors X — 1 and X — 2 indicates that G is at least 3- 
chromatic. 

Chromatic polynomials have been studied in great detail in the literature. 
The interested reader is referred to [8-5] for a more thorough discussion of 
their properties. Theorems 8-4, 8-5, and 8-6 should provide a glimpse into the 
colorful world of chromatic polynomials. 

Theorem 8-4 

A graph of n vertices is a complete graph if and only if its chromatic polynomial 
is 

P n (X) = A(A - l)(A - 2)... (A - n + 1). 

Proof: With X colors, there are X different ways of coloring any selected vertex 
of a graph. A second vertex can be colored properly in exactly X — 1 ways, the 
third in X — 2 ways, the fourth in X — 3 ways,. . . , and the nth in X — n + 1 
ways if and only if every vertex is adjacent to every other. That is, if and only if 
the graph is complete. ■ 

Theorem 8-5 

An n-vertex graph is a tree if and only if its chromatic polynomial 


P,0) = A(A - !)■-'. 
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Proof: That the theorem holds for n = 1, 2 is immediately evident. It is left 
as an exercise to prove the theorem by induction (Problem 8-9). 

Theorem 8-6 

Let a and b be two nonadjacent vertices in a graph C. Let G' be a graph obtained 
by adding an edge between a and b. Let G" be a simple graph obtained from G 
by fusing the vertices a and b together and replacing sets of parallel edges with 
single edges. Then 


P n {X) of G = P n {X) of G' + P n .fk) of G". 

Proof: The number of ways of properly coloring G can be grouped into two 
cases, one such that vertices a and b are of the same color and the other such that 
a and b are of different colors. Since the number of ways of properly coloring G 
such that a and b have different colors = number of ways of properly coloring G', 
and 

number of ways of properly coloring G such that a and b have the same color 



P 5 (A) of (7 = A(A - 1) (A - 2) + 2A(A- 1) (A - 2) (A - 3) 

+ A(A- 1) (A — 2) (A — 3) (A — 4) 
= A (A - 1) (A - 2) (A 2 - 5A + 7) 

Fig. 8-5 Evaluation of a chromatic polynomial. 
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= number of ways of properly coloring G", 

P„U) of G = P n (X ) of G' + P„_ ,(A) of G". M 

Theorem 8-6 is often used in evaluating the chromatic polynomial of a 
graph. For example, Fig. 8-5 illustrates how the chromatic polynomial of a 
graph G is expressed as a sum of the chromatic polynomials of four complete 
graphs. The pair of nonadjacent vertices shown enclosed in circles is the one 
used for reduction at that stage. 

In the last three sections we have been concerned with proper coloring of 
the vertices in a graph. Suppose that we are interested in coloring the edges 
rather than the vertices. It is reasonable to call two edges adjacent if they have 
one end vertex in common (but are not parallel). A proper coloring of edges 
then requires that adjacent edges should be of different colors. Some results 
on proper coloring of edges, similar to the results given in Sections 8-1 and 
8-2, can be derived (Problem 8-19). 

Moreover, a set of edges in which no two are adjacent is similar to an 
independent set of vertices. Such a set of edges is called a matching , the sub¬ 
ject of the next section. 


8-4. MATCHINGS 

Suppose that four applicants a v a 2 , a 3 , and a 4 are available to fill six 
vacant positions p x ,p 1 -,p i , p^, p 5 , and p 6 . Applicant a, is qualified to fill posi¬ 
tion p 2 or p 5 . Applicant a 2 can fill p 2 or p 3 . Applicant a 3 is qualified for p x ,p 2 , 
p 3 ,Pt, or p 6 . Applicant a 4 can fill jobs p 2 or p 5 . This situation is represented by 
the graph in Fig. 8-6. The vacant positions and applicants are represented by 
vertices. The edges represent the qualifications of each applicant for filling 



Applicants 


Positions 


Fig. 8-6 Bipartite graph. 
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Fig. 8-7 Graph and two of its maximal matchings. 

different positions. The graph clearly is bipartite, the vertices falling into two 
sets F! = {a„ a 2 , a 3 , a 4 j and V 2 = {/?,, p 2 , p 3 , p A , p 5 , p 6 }. 

The questions one is most likely to ask in this situation are: Is it possible 
to hire all the applicants and assign each a position for which he is suitable? 
If the answer is no, what is the maximum number of positions that can be 
filled from the given set of applicants? 

This is a problem of matching (or assignment) of one set of vertices into 
another. More formally, a matching in a graph is a subset of edges in which 
no two edges are adjacent. A single edge in a graph is obviously a matching. 

A maximal matching is a matching to which no edge in the graph can be 
added. For example, in a complete graph of three vertices (i.e., a triangle) 
any single edge is a maximal matching. The edges shown by heavy lines in 
Fig. 8-7 are two maximal matchings. Clearly, a graph may have many differ¬ 
ent maximal matchings, and of different sizes. Among these, the maximal 
matchings with the largest number of edges are called the largest maximal 
matchings. In Fig. 8-7(b), a largest maximal matching is shown in heavy lines. 
The number of edges in a largest maximal matching is called the matching 
number of the graph. 

Although matching is defined for any graph, it is mostly studied in the 
context of bipartite graphs, as suggested by the introduction to this section. 
In a bipartite graph having a vertex partition V , and V 2 , a complete matching 
of vertices in set F, into those in V 2 is a matching in which there is one edge 
incident with every vertex in F,. In other words, every vertex in V, is matched 
against some vertex in V 2 . Clearly, a complete matching (if it exists) is a larg¬ 
est maximal matching, whereas the converse is not necessarily true. 

For the existence of a complete matching of set F, into set F 2 , first we 
must have at least as many vertices in V 2 as there are in F,. In other words, 
there must be at least as many vacant positions as the number of applicants if 
all the applicants are to be hired. This condition, however, is not sufficient. 
For example, in Fig. 8-6, although there are six positions and four applicants, 
a complete matching does not exist. Of the three applicants a x , a 2 , and a 4 , 
each qualifies for the same two positions p 2 and p $ , and therefore one of the 
three applicants cannot be matched. 

This leads us to another necessary condition for a complete matching: 
Every subset of r vertices in F, must collectively be adjacent to at least r 
vertices in F 2 , for all values of r = 1, 2,..., | V, |. This condition is not 
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satisfied in Fig. 8-6. The subset (a,, a 2 , a 4 ] of three vertices has only two ver¬ 
tices p 2 and p 5 adjacent to them. That this condition is also sufficient for 
existence of a complete matching is indeed surprising. Theorem 8-7 is a for¬ 
mal statement and proof of this result. 

Theorem 8-7 

A complete matching of F, into V 2 in a bipartite graph exists if and only if 
every subset of r vertices in V, is collectively adjacent to r or more vertices in V 2 
for all values of r. 

Proof: The “only if” part (i.e., the necessity of a subset of r applicants collec¬ 
tively qualifying for at least r jobs) is immediate and has already been pointed out. 
The sufficiency (i.e., the “if” part) can be proved by induction on r, as the theorem 
trivially holds for r -- 1. For a complete proof, the student is referred to Theorem 
11-1 in [8-3], Theorem 5-19 in [4-5], or Chapter 4 in [1-9]. 

Let us illustrate this important theorem with an example. 

Problem of Distinct Representatives: Five senators a,, s 2 , s 3 , a 4 , and s 5 are 
members of three committees, c { , c 2 , and c 3 . The membership is shown in 
Fig. 8-8. One member from each committee is to be represented in a super¬ 
committee. Is it possible to send one distinct representative from each of the 
committeest ? 

This problem is one of finding a complete matching of a set F, into set 
V 2 in a bipartite graph. Let us use Theorem 8-7 and check if r vertices from 
V, are collectively adjacent to at least r vertices from V 2 , for all values of r. 
The result is shown in Table 8-1 (ignore the last column for the time being). 

Thus for this example the condition for the existence of a complete match¬ 
ing is satisfied as stated in Theorem 8-7. Hence it is possible to form the super¬ 
committee with one distinct representative from each committee. 

The problem of distinct representatives just solved was a small one. A 



Committees Senators 

Fig. 8-8 Membership of committees. 

'[This problem, known as the problem of distinct representatives , was first formulated 
and studied by the English mathematician, Philip Hall, in 1935. 
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V\ 

Vi 

r - q 

r = 1 

{c,} 

{s\,s 2 } 

-1 


[C 2 ) 

(s 1 9 *^3» ^4-) 

-2 


(C3l 

{^3> ^4» ^ 5 } 

-2 

r = 2 

ki, c 2 ] 

{•^ 1 > ^ 2? 5 3 , *£ 4 ] 

-2 


&2, C 3 } 

{ 51 , 53 , 54 , 55 } 

-2 


[c 3 1 Cl} 

{s 1 , S 2 , S 3 , 54 , ^ 5 } 

-3 

r = 3 

{C\,C 2 , C3 } 

(s*!, ■S' 2 > $ 3» *S4j ^ 5 } 

-2 


Table 8-1 


larger problem would have become unwieldy. If there are M vertices in V v 
Theorem 8-7 requires that we take all 2 M — 1 nonempty subsets of V , and 
find the number of vertices of V 2 adjacent collectively to each of these. In 
most cases, however, the following simplified version of Theorem 8-7 will 
suffice for detection of a complete matching in any large graph. 

Theorem 8-8 

In a bipartite graph a complete matching of F, into V 2 exists if (but not only 
if) there is a positive integer m for which the following condition is satisfied: 

degree of every vertex in V, > m > degree of every vertex in V 2 . 

Proof: Consider a subset of r vertices in V t . These r vertices have at least 
m-r edges incident on them. Each m-r edge is incident to some vertex in V 2 . 
Since the degree of every vertex in set V 2 is no greater than m, these m-r edges 
are incident on at least (m-r)/in = r vertices in V 2 . 

Thus any subset of r vertices in V, is collectively adjacent to r or more vertices 
in V 2 . Therefore, according to Theorem 8-7, there exists a complete matching of 

into V 2 . ■ 

In the bipartite graph of Fig. 8-8, 

degree of every vertex in V, > 2 > degree of every vertex in V 2 . 

Therefore, there exists a complete matching. 

In the bipartite graph of Fig. 8-6 no such number is found, because the 
degree of p 2 — 4 > degree of a,. 

It must be emphasized that the condition of Theorem 8-8 is a sufficient 
condition and not necessary for the existence of a complete matching. It will 
be instructive for the reader to sketch a bipartite graph that does not satisfy 
Theorem 8-8 and yet has a complete matching (Problem 8-15). 

The matching problem or the problem of distinct representatives is also 
called the marriage problem (whose solution, unfortunately, is of little use to 
those with real marital problems!) See Problem 8-16. 
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If one fails to find a complete matching, he is most likely to be interested 
in finding a maximal matching, that is, to pair off as many vertices of F, with 
those in V 2 as possible. For this purpose, let us define a new term called 
deficiency, 8(G), of a bipartite graph G. 

A set of r vertices in F, is collectively incident on, say, q vertices of V 2 . 
Then the maximum value of the number r — q taken over all values of r — 1, 
2,. . . and all subsets of F, is called the deficiency 8(G) of the bipartite graph 
G. 

Theorem 8-7, expressed in terms of the deficiency, states that a complete 
matching in a bipartite graph G exists if and only if 

8(G) < 0. 

For example, the deficiency of the bipartite graph in Fig. 8-7 is — 1 (the 
largest number in the last column of Table 8-1). It is suggested that you 
prepare a table for the graph of Fig. 8-6, similar to Table 8-1, and verify that 
the deficiency is + 1 for this graph (Problem 8-17). 

Theorem 8-9 gives the size of the maximal matching for a bipartite graph 
with a positive deficiency. 

Theorem 8-9 

The maximal number of vertices in set V x that can be matched into V 2 is equal to 

number of vertices in F, — 8(G). 

The proof of Theorem 8-9 can be found in [8-3], page 288. The size of a 
maximal matching in Fig. 8-6, using Theorem 8-9, is obtained as follows: 

number of vertices in F, — 8(G) = 4 — 1 =3. 

Matching and Adjacency Matrix: Consider a bipartite graph G with non- 
adjacent sets of vertices F, and V 2 , having number of vertices n x and n 2 , 
respectively, and let /?, < n 2 , n x + n 2 = n, the number of vertices in G. The 
adjacency matrix X(G) of G can be written in the form 


X(G) 



where the submatrix X 12 is the n x by n 2 , (0, l)-matrix containing the informa¬ 
tion as to which of the n x vertices of F, are connected to which of the n 2 ver¬ 
tices of F 2 . Matrix X^ 2 is the transpose of X 12 . 

Clearly, all the information about the bipartite graph G is contained in 
its X, 2 matrix. 

A matching F, into V 2 corresponds to a selection of the l’s in the matrix 
X, 2 such that no line (i.e., a row or a column) has more than one 1. 
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The matching is complete if the n x by n 2 matrix made of selected l’s has 
exactly one 1 in every row. For example, the X 12 matrix for Fig. 8-8 is 
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s 5 |. 




A complete matching of V 1 into V 2 is given by 




S 2 S 3 5 4 S$ 




M =c 2 




0 10 0 0 
1 0 0 0 0 
0 0 0 0 1 


A maximal matching corresponds to the selection of a largest possible 
number of l’s from X 12 such that no row in it has more than one 1. Therefore, 
according to Theorem 8-9, in matrix X 12 the largest number of l’s, no two 
of which are in one row, is equal to 


number of vertices in V, — S(G). 

Matching problems in bipartite graphs can also be formulated in terms of 
the flow problem (see Section 14-5). All edges are assumed to be of unit 
capacity, and the problem of finding a maximal matching is reduced to the 
problem of maximizing flow from the source to the sink (also see [8-3]). 


8-5. COVERINGS 

In a graph G, a set g of edges is said to cover G if every vertex in G is in¬ 
cident on at least one edge in g. A set of edges that covers a graph G is said to 
be an edge covering , a covering subgraph, or simply a covering of G. For 
example, a graph G is trivially its own covering. A spanning tree in a con¬ 
nected graph (or a spanning forest in an unconnected graph) is another 
covering. A Hamiltonian circuit (if it exists) in a graph is also a covering. 

Just any covering is too general to be of much interest. We have already 
dealt with some coverings with specific properties, such as spanning trees and 
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(a) 


(b) 


Fig. 8-9 Graph and two of its minimal coverings. 

Hamiltonian circuits. In this section we shall investigate the minimal cover¬ 
ing —a covering from which no edge can be removed without destroying its 
ability to cover the graph. In Fig. 8-9 a graph and two of its minimal cover¬ 
ings are shown in heavy lines. 

The following observations should be made: 

1. A covering exists for a graph if and only if the graph has no isolated 
vertex. 

2. A covering of an ^-vertex graph will have at least \n/2] edges, (fx] de¬ 
notes the smallest integer not less than x.) 

3. Every pendant edge in a graph is included in every covering of the 
graph. 

4. Every covering contains a minimal covering. 

5. If we denote the remaining edges of a graph by ( G — g), the set of 
edges g is a covering if and only if, for every vertex v, the degree of 
vertex in (G — g) < (degree of vertex v in G) — 1. 

6. No minimal covering can contain a circuit, for we can always remove 
an edge from a circuit without leaving any of the vertices in the circuit 
uncovered. Therefore, a minimal covering of an w-vertex graph can 
contain no more than n — 1 edges. 

7. A graph, in general, has many minimal coverings, and they may be of 
different sizes (i.e., consisting of different numbers of edges). The 
number of edges in a minimal covering of the smallest size is called the 
covering number of the graph. 

Theorem 8-10 

A covering g of a graph is minimal if and only if g contains no paths of length 
three or more. 
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Fig. 8-10 Star graphs of one, two, three, and four edges. 

Proof: Suppose that a covering g contains a path of length three, and it is 


v\e x v 2 e 2 v 3 e 3 v A . 


Edge e 2 can be removed without leaving its end vertices v z and v 3 uncovered. 
Therefore, g is not a minimal covering. 

Conversely, if a covering g contains no path of length three or more, all its 
components must be star graphs (i.e., graphs in the shape of stars; see Fig. 8-10). 
From a star graph no edge can be removed without leaving a vertex uncovered. 
That is, g must be a minimal covering. ■ 

Suppose that the graph in Fig. 8-9 represents the street map of a part of a 
city. Each of the vertices is a potential trouble spot and must be kept under 
the surveillance of a patrol car. How will you assign a minimum number of 
patrol cars to keep every vertex covered ? 

The answer is a smallest minimal covering. The covering shown in Fig. 
8-9(a) is an answer, and it requires six patrol cars. Clearly, since there are 11 
vertices and no edge can cover more than two, less than six edges cannot cover 
the graph. 

Minimization of Switching Functions t: An important step in the logical 
design of a digital machine is to minimize Boolean functions before imple¬ 
menting them. Suppose we are interested in building a logical circuit that 
gives the following function Fof four Boolean variables w, x, y, and z. 

F — wxyz 4 - wxyz + wxyz + wxyz + wxyz -f- wxyz + wxyz, 

where + denotes logical OR, xy denotes x AND y, and x denotes NOT x. 

Let us represent each of the seven terms in F by a vertex, and join every 
pair of vertices that differ only in one variable. Such a graph is shown in Fig. 
8-11. 

An edge between two vertices represents a term with three variables. 

A minimal cover of this graph will represent a simplified form of F, per¬ 
forming the same function as F, but with less logic hardware. 

The pendant edges 1 and 7 must be included in every covering of the 

fThose not familiar with switching functions may skip this subsection. 
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graph. Therefore, the terms 

xyz and xyz are essential. 

Two additional edges 3 and 6 (or 4 and 5 or 3 and 5) will cover the remainder. 
Thus a simplified version of F is 


F — xyz + xyz + wyz + wyz. 

This expression can again be represented by a graph of four vertices, as shown 
in Fig. 8-12. 

The essential terms xyz and xyz cannot be covered by any edge, and hence 
cannot be minimized further. One edge will cover the remaining two vertices 
in Fig. 8-12. Thus the minimized Boolean expression is 

F — xyz -f- xyz + wy. 

Dimer Problem: In crystal physics, a crystal is represented by a three- 
dimensional lattice. Each vertex in the lattice represents an atom, and an 
edge between vertices represents the bond between the two atoms. In the 
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study of the surface properties of crystals, one is interested in two-dimen¬ 
sional lattices, such as the two shown in Fig. 1-10. 

To obtain an analytic expression for certain surface properties of crystals 
consisting of diatomic molecules (also called dimers ), one is required to find 
the number of ways in which all atoms on a two-dimensional lattice can be 
paired off as molecules (each consisting of two atoms). The problem is equiv¬ 
alent to finding all different coverings of a given graph such that every vertex 
in the covering is of degree one. Such a covering in which every vertex is of 
degree one is called a dimer covering or a 1-factor. A dimer covering is obvi¬ 
ously a matching because no two edges in it are adjacent. Moreover, a dimer 
covering is a maximal matching. This is why a dimer covering is often referred 
to as a perfect matching. 

Two different dimer coverings are shown in heavy lines in the graph in 
Fig. 8-13. 

Clearly, a graph must have an even number of vertices to have a dimer 
covering. This condition, however, is not enough (Problem 8-21). 




Fig. 8-13 Two dimer coverings of a 
graph. 


8-6. FOUR-COLOR PROBLEM 


So far we have considered proper coloring of vertices and proper coloring 
of edges. Let us briefly consider the proper coloring of regions in a planar 
graph (embedded on a plane or sphere). Just as in coloring of vertices and 
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edges, the regions of a planar graph are said to be properly colored if no two 
contiguous or adjacent regions have the same color. (Two regions are said to 
be adjacent if they have a common edge between them. Note that one or more 
vertices in common does not make two regions adjacent.) The proper coloring 
of regions is also called map coloring, referring to the fact that in an atlas 
different countries are colored such that countries with common boundaries 
are shown in different colors. 

Once again we are not interested in just properly coloring the regions of 
a given graph. We are interested in a coloring that uses the minimum number 
of colors. This leads us to the most famous conjecture in graph theory. The 
conjecture is that every map (i.e., a planar graph) can be properly colored 
with four colors. The four-color conjecture, already referred to in Chapter 1, 
has been worked on by many famous mathematicians for the past 100 years. 
No one has yet been able to either prove the theorem or come up with a map 
(in a plane) that requires more than four colors. 

That at least four colors are necessary to properly color a graph is im¬ 
mediate from Fig. 8-14, and that five colors will suffice for any planar graph 
will be shown shortly. 

Two remarks may be made here in passing. Paradoxically, for surfaces 
more complicated than the plane (or sphere) corresponding theorems have 
been proved. For example, it has been proved that seven colors are necessary 
and sufficient for properly coloring maps on the surface of a torus.t Second, 
it has been proved that all maps containing less than 40 regions can be proper¬ 
ly colored with four colors. Therefore, if in general the four-color conjecture 
is false, the counterexample has to be a very complicated and large one. 

Vertex Coloring Versus Region Coloring: From Chapter 5 we know that 
a graph has a dual if and only if it is planar. Therefore, coloring the regions 
of a planar graph G is equivalent to coloring the vertices of its dual G*, and 



fin fact, the Heawood map-coloring theorem gives the exact number of colors required 
for every orientable surface more complicated than that of a sphere. See page 136, [1-5], 
or page 94, [1-2]. 
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vice versa. Thus the four-color conjecture can be restated as follows: Every 
planar graph has a chromatic number of four or less. 

Five-Color Theorem: We shall now show that every planar map can be 
properly colored with five colors: 

Theorem 8-11 

The vertices of every planar graph can be properly colored with five colors. 

Proof: The theorem will be proved by induction. Since the vertices of all graphs 
(self-loop-free, of coursef) with 1, 2,3,4, or 5 vertices can be properly colored with 
five colors, let us assume that vertices of every planar graph with n — 1 vertices 
can be properly colored with five colors. Then, if we prove that any planar graph G 
with n vertices will require no more than five colors, we shall have proved the 
theorem. 

Consider the planar graph G with n vertices. Since G is planar, it must have 
at least one vertex with degree five or less (Problem 5-4). Let this vertex be v. 

Let G' be a graph (of n — 1 vertices) obtained from G by deleting vertex v (i.e., 
v and all edges incident on v). Graph G' requires no more than five colors, according 
to the induction hypothesis. Suppose that the vertices in G' have been properly 
colored, and now we add to it v and all edges incident on v. If the degree of v is 
1, 2, 3, or 4, we have no difficulty in assigning a proper color to v. 

This leaves only the case in which the degree of v is five, and all the five colors 
have been used in coloring the vertices adjacent to v, as shown in Fig. 8-15(a). 
(Note that Fig. 8-15 is part of a planar representation of graph G'.) 

Suppose that there is a path in G' between vertices a and c colored alternately 
with colors 1 and 3, as shown in Fig. 8-15(b). Then a similar path between b and 
d, colored alternately with colors 2 and 4, cannot exist; otherwise, these two paths 


Color 1 




Fig. 8-15 Reassigning of colors. 
fSee “Regularization of a Planar Graph” in this section. 
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will intersect and cause G to be nonplanar. (This is a consequence of the Jordan 
curve theorem, used in Section 5-3, also.) 

If there is no path between b and d colored alternately with colors 2 and 4, 
starting from vertex b we can interchange colors 2 and 4 of all vertices connected 
to b through vertices of alternating colors 2 and 4. This interchange will paint 
vertex b with color 4 and yet keep G' properly colored. Since vertex d is still with 
color 4, we have color 2 left over with which to paint vertex v. 

Had we assumed that there was no path between a and c of vertices painted 
alternately with colors 1 and 3, we would have released color 1 or 3 instead of 
color 2. And thus the theorem. ■ 


Regularization of a Planar Graph: Removing every vertex of degree one 
(together with the pendant edge) from the graph G does not affect the regions 
of a planar graph. Nor does the elimination of every vertex of degree two, by 
merging the two edges in series (Fig. 5-6), have any effect on the regions of a 
planar graph. 

Now consider a typical vertex v of degree four or more in a planar graph. 
Let us replace vertex v by a small circle with as many vertices as there were 
incidences on v. This results in a number of vertices each of degree three 
(see Fig. 8-16). 

Performing this transformation on every vertex of degree four or more in 
a planar graph G will produce another planar graph H in which every vertex 
is of degree three. When the regions of H have been properly colored, a proper 
coloring of the regions of G can be obtained simply by shrinking each of the 
new regions back to the original vertex. 

Such a transformation may be called regularization of a planar graph, 
because it converts a planar graph G into a regular planar graph H of degree 
three. Clearly, if H can be colored with four colors, so can G. Thus, for map¬ 
coloring problems, it is sufficient to confine oneself to (connected) planar, 
regular graphs of degree three. And the four-color conjecture may be restated 
as follows: 



(a) (b) 

Fig. 8-16 Regularization of a graph. 
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The regions of every planar, regular graph of degree three can be colored 
properly with four colors. 

If, in a planar graph G, every vertex is of degree three, its dual G* is a 
planar graph in which every region is bounded by three edges; that is, G* is 
a triangular graph. Thus the four-color conjecture may again be restated as 
follows: The chromatic number of every triangular, planar graph is four or 
less. 


SUMMARY 

In the first three sections of this chapter, we were concerned with proper 
coloring of the vertices of a graph. This led us to the chromatic partitioning 
of the vertices. In the process we also developed the concept of an indepen¬ 
dent set of vertices and a dominating set of vertices. Associated in a natural 
way with these sets we found maximal independent sets, largest maximal 
independent sets, and the independence number; minimal dominating sets, 
smallest minimal dominating sets, and the domination number. The chro¬ 
matic polynomial, studied in Section 8-3, was also a direct consequence of 
proper coloring of vertices. 

Sections 8-4 and 8-5 contain developments parallel to those in Sections 
8-1 and 8-2, except that Sections 8-4 and 8-5 are concerned with proper color¬ 
ing of the edges of a graph rather than the vertices. A matching is an inde¬ 
pendent set of edges, that is, a set of edges no two of which are adjacent. A 
maximal matching is a maximal set of independent edges. An edge covering is 
somewhat similar to a dominating set of edges in the sense that every edge in 
the graph is either in a covering or is adjacent to it. A dimer covering is a 
perfect matching. An independent set is also a dominating set if and only if 
it is maximal. Likewise, a matching is also a covering if and only if it is per¬ 
fect. The sketch in Fig. 8-17 summarizes the relationships among these con¬ 
cepts. The arrows indicate the direction of increasing restriction. 

The last section deals with proper coloring of regions in a planar graph 
rather than vertices or edges. 
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(b) Edge Subsets 
Fig. 8-17 Structure of terms. 






192 


COLORING, COVERING, AND PARTITIONING 


CHAP. 8 


early history of the four-color problem. For an entire book devoted to this unsolved 
problem, see Ore [1-11], 

8-1. Brooks, R. L., “On Coloring the Nodes of a Network,” Proc. Cambridge Phil. Soc., 
Vol., 37, 1941, 194-197. 

8-2. Kasteleyn, P. W., “Graph Theory and Crystal Physics,” in Graph Theory and 
Theoretical Physics (F. Harary, ed.). New York, Academic Press, Inc., 1967, 43-110. 
8-3. Liu, C. L., Introduction to Combinatorial Mathematics, McGraw-Hill Book Com¬ 
pany, New York, 1968. 

8-4. Mirsky, L., and H. Perfect, “Systems of Representatives,” J. Math. Anal. Appl., 
Vol. 3, 1966, 520-568. 

8-5. Read, R. C., “An Introduction to Chromatic Polynomials,” J. Combinatorial Theory, 
Vol. 4, No. 1, 1968, 52-71. 

8-6. Wilkov, R. S., and W. H. Kim, “A Practical Approach to the Chromatic Partition 
Problem,” J. Franklin Inst., Vol. 289, No. 5, May 1970, 333-349. 

PROBLEMS 

8-1. Prove that the chromatic number of a graph will not exceed by more than one 
the maximum degree of the vertices in a graph. 

8-2. Show that the graph in Fig. 8-4 has only one chromatic partition. What is it? 

8-3. Show that the chromatic number of a graph G cannot exceed the diameter (i.e., 
the length of the longest path) of G by more than one. 

8-4. Show that a simple graph with n vertices and more than [_« 2 /4J edges cannot be a 
bipartite graph. 

8-5. A bipartite graph is said to be a complete bipartite graph if there is one edge between 
every vertex of set V\ to every vertex of set V 2 . Show that the maximum number 
of edges in a complete bipartite graph of n vertices is |_« 2 /4J. 

8-6. Show that if a bipartite graph has any circuits, they must all be of even lengths. 

8-7. Tn a chessboard, show the positions of 

(a) The minimum number of queens that collectively dominate all 64 squares (an 
example of a minimal dominating with smallest number of vertices). 

(b) The maximum number of queens such that none of them can take another 
(an example of a maximal independent set with largest number of vertices). 

8-8. Find the chromatic polynomial of the graph in Fig. 8-7. 

8-9. Using induction on n, prove Theorem 8-5. {Hint: Use a technique similar to one 
used in proving Theorem 3-3.) 

8-10. Show that the chromatic polynomial of a graph of n vertices satisfies the inequality 

F„(A)<:A(A - I)"-'. 

{Hint: Use Theorem 8-5.) 

8-11. Show that the chromatic polynomial of a graph consisting of a single circuit of 
length n (i.e., an n-gon) is 

P„{X) = {\ - 1)« + (A — l)(-l) n . 

8-12. Show that the absolute value of the second coefficient of A” -1 in the chromatic 
polynomial P n { A) of a graph equals the number of edges in the graph. 

8-13. Sketch two different (i.e., nonisomorphic) graphs that have the same chromatic 
polynomial. 
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8-14. Suppose that you are required to make a class schedule in a university. There are 
a total of n courses to be taught in m available hours of the week. There are pairs 
of courses that cannot be taught at the same time because some students might 
like to take both. Explain how you will make the schedule. State the condition 
when it will be impossible to make a compatible schedule. {Hint: Try properly 
coloring n vertices with m available colors.) 

8-15. Sketch a bipartite graph that does not satisfy the condition in Theorem 8-8 and 
yet has a complete matching. 

8-16. In a village there are an equal number of boys and girls of marriageable age. Each 
boy dates a certain number of girls and each girl dates a certain number of boys. 
Under what condition is it possible that every boy and girl gets married to one of 
their dates? (Polygamy and polyandry not allowed.) 

8-17. Make a complete table (like Table 8-1) for the graph of Fig. 8-6 to determine 
whether or not a complete matching exists. Find the deficiency number from this 
table. 

8-18. Show that a nonnull graph is 2-chromatic if and only if, for all odd r, every diagonal 
entry in matrix X r is zero. The matrix X is the adjacency matrix of the graph. 
{Hint: Use Theorem 8-2.) 

8-19. Just as with an independent set of vertices (or simply independent set), define an 
independent set of edges in a graph as a set of nonadjacent edges (not incident on 
a common vertex). Make some observations parallel to those in Section 8-2. 
Observe that matching is an independent set of edges. What are complete matchings, 
maximal matchings, and so on? 

8-20. Explore how the covering number of a graph G with n vertices is related to the 
diameter of G. 

8-21. Sketch a graph with an even number of vertices that has no dimer covering. 

8-22. Show that the regions of a simple planar graph G can be colored properly with 
two colors if and only if every vertex in G is of even degree. {Hint: Use Theorem 
8-2 and Problem 4-28.) 

8-23. From v distinct objects one can select v\/[k\{v — &)!] combinations of k objects. 
Two such combinations are m-related if they have m or more objects in common, 
m<k. This relationship can be expressed by means of a graph with v\/[k\{v — fc)!] 
vertices and with edges between every pair of m-related combinations. Make obser¬ 
vations on the properties of such a graph. Give conditions for which this graph is 
(a) a null graph; (b) a complete graph. How will you select a largest set of com¬ 
binations that are not m-related? Illustrate your method by sketching the graph 
for v = 6, k = 3, and m = 2. 

8-24. An N by N square in which objects a\, az,..., on are arranged in such a way that 
each object appears exactly once in each row and exactly once in each column is 
called a Latin square. A Latin square can also be represented by a complete bipartite 
graph of 2N vertices. What is the total number of different matchings in such a 
graph? How many of these matchings are edge disjoint? 

8-25. Call a subset of vertices that includes at least one vertex incident on every edge of G 
a vertex cover of G. Show that the number of vertices in the smallest vertex cover is 
equal to or less than the domination number of G. 



DIRECTED GRAPHS 


The graphs studied in this book so far have been undirected graphs. No 
direction was assigned to the edges in a graph. An edge e k between vertices 
v t and Vj could be considered as going from vertex v i to vertex v s or from v } to 
v t . In this chapter we shall consider directed graphs—graphs in which edges 
have directions. 

Many physical situations require directed graphs. The street map of a city 
with one-way streets, flow networks with valves in the pipes, and electrical 
networks, for example, are represented by directed graphs. Directed graphs 
are employed in abstract representations of computer programs, where the 
vertices stand for the program instructions and the edges specify the execution 
sequence. The directed graph is an invaluable tool in the study of sequential 
machines. Directed graphs in the form of signal-flow graphs are used for 
system analysis in control theory. 

Most of the concepts and terminology of undirected graphs are also 
applicable to directed graphs. For example, the planarity of a graph does not 
depend on whether the graph is directed or undirected, and therefore Chapter 
5 is applicable to both directed and undirected graphs. The same is true for 
most other topics covered so far. It would be wasteful to devote another eight 
chapters to the study of directed graphs, mostly repeating, with minor 
changes, what has already been said. In this chapter, therefore, we shall 
mainly bring out those properties of directed graphs that are not shared by 
undirected graphs. 

9-1. WHAT IS A DIRECTED GRAPH? 

A directed graph (or a digraph for short) G consists of a set of vertices 
V — {v x , v 2 , . . .], a set of edges E — [e x , e 2 , . . .}, and a mapping that maps 
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Fig. 9-1 Directed graph with 5 vertices 
and 10 edges. 

every edge onto some ordered pair of vertices {v., Vj). As in the case of 
undirected graphs, a vertex is represented by a point and an edge by a line 
segment between v. and v s with an arrow directed from v. to v r For example, 
Fig. 9-1 shows a digraph with five vertices and ten edges. A digraph is also 
referred to as an oriented graphA 

In a digraph an edge is not only incident on a vertex, but is also incident 
out of a vertex and incident into a vertex. The vertex v., which edge e k is 
incident out of, is called the initial vertex of e k . The vertex v jy which e k is 
incident into, is called the terminal vertex of e k . In Fig. 9-1, v $ is the initial 
vertex and v A is the terminal vertex of edge e 7 . An edge for which the initial 
and terminal vertices are the same forms a self-loop, such as e 5 . (Some 
authors reserve the term arc for an oriented or directed edge. We use the 
term edge to mean either an undirected or a directed edge. Whenever there is 
a possibility of confusion, we shall explicitly state directed or undirected edge.) 

The number of edges incident out of a vertex v t is called the out-degree (or 
out-valence or outward demidegree) of v ; and is written d + (v t ). The number of 
edges incident into v. is called the in-degree (or in-valence or inward demi¬ 
degree) of v t and is written as d~{vf In Fig. 9-1, for example, 

d + iv ,) = 3, d-(v y ) = 1, 

d + (v 2 ) = 1, d~(v 2 ) = 2, 

d + (v 5 ) = 4, d~(v 5 ) = 0. 

It is not difficult to prove (Problem 9-1) that in any digraph G the sum of 
all in-degrees is equal to the sum of all out-degrees, each sum being equal to 

fSome authors make a distinction between the terms “oriented graph” and “directed 
graph” by reserving the former for only those digraphs which have at most one 
directed edge between a pair of vertices. This often leads to confusion; therefore, we use 
these two terms synonymously. 
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the number of edges in G ; that is, 

S d+ ( v <) = E 

i=i i=i 

An isolated vertex is a vertex in which the in-degree and the out-degree 
are both equal to zero. A vertex v in a digraph is called pendant if it is of 
degree one, that is, if 


d + (v) + d-(v)= 1. 

Two directed edges are said to be parallel if they are mapped onto the 
same ordered pair of vertices. That is, in addition to being parallel in 
the sense of undirected edges, parallel directed edges must also agree in the 
direction of their arrows. In Fig. 9-1, edges e 8 , e 9 , and e 10 are parallel, 
whereas edges e 2 and e 3 are not. 

Since many properties of directed graphs are the same as those of 
undirected ones, it is often convenient to disregard the orientations of edges 
in a digraph. Such an undirected graph obtained from a directed graph G will 
be called the undirected graph corresponding to G. 

On the other hand, given an undirected graph H, we can assign each edge 
of H some arbitrary direction. The resulting digraph, designated by H is 
called an orientation of H (or a digraph associated with H ). Note that while a 
given digraph has a unique (within isomorphism) undirected graph corre¬ 
sponding to it, a given undirected graph may have ’’different" orientations 
possible. This is why we say the undirected graph corresponding to a digraph, 
but an orientation of a graph. 

This brings us to the question: When are two digraphs considered to be 
the same or isomorphic? 

Isomorphic Digraphs: Isomorphic graphs were defined such that they have 
identical behavior in terms of graph properties. In other words, if their labels 
are removed, two isomorphic graphs are indistinguishable. For two digraphs 



Fig. 9-2 Two nonisomorphic digraphs. 
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to be isomorphic not only must their corresponding undirected graphs be 
isomorphic, but the directions of the corresponding edges must also agree. 
For example, Fig. 9-2 shows two digraphs that are not isomorphic, although 
they are orientations of the same undirected graph. 

Figure 9-2 immediately suggests a problem. What is the number of 
distinct (i.e., nonisomorphic) orientations of a given undirected graph? The 
problem was solved by F. Harary and E. M. Palmer in 1966. Some specific 
cases are left as an exercise (Problem 9-3). 

9-2. SOME TYPES OF DIGRAPHS 

Like their undirected sisters, digraphs come in many varieties. In fact, due 
to the choice of assigning a direction to each edge, directed graphs have more 
varieties than undirected ones. 

Simple Digraphs: A digraph that has no self-loop or parallel edges is 
called a simple digraph (Figs. 9-2 and 9-3, for example). 

Asymmetric Digraphs: Digraphs that have at most one directed edge 
between a pair of vertices, but are allowed to have self-loops, are called 
asymmetric or antisymmetric. 

Symmetric Digraphs: Digraphs in which for every edge ( a , b) (i.e., from 
vertex a to b) there is also an edge ( b , a). 

A digraph that is both simple and symmetric is called a simple symmetric 
digraph. Similarly, a digraph that is both simple and asymmetric is simple 
asymmetric. The reason for the terms symmetric and asymmetric will be 
apparent in the context of binary relations in Section 9-3. 

Complete Digraphs: A complete undirected graph was defined as a simple 
graph in which every vertex is joined to every other vertex exactly by one edge. 
For digraphs we have two types of complete graphs. A complete symmetric 
digraph is a simple digraph in which there is exactly one edge directed from 
every vertex to every other vertex (Fig. 9-3), and a complete asymmetric 
digraph is an asymmetric digraph in which there is exactly one edge between 
every pair of vertices (Fig. 9-2). 

A complete asymmetric digraph of n vertices contains n(n — l)/2 edges, 
but a complete symmetric digraph of n vertices contains n(n — 1) edges. A 
complete asymmetric digraph is also called a tournament or a complete 
tournament (the reason for this term will be made clear in Section 9-10). 

A digraph is said to be balanced if for every vertex v t the in-degree equals 
the out-degree; that is, d + (v t ) = d~{v^). (A balanced digraph is also referred 
to as a pseudosymmetric digraph, or an isograph .) A balanced digraph is said 
to be regular if every vertex has the same in-degree and out-degree as every 
other vertex. 
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Fig. 9-3 Complete symmetric digraph 
of four vertices. 


9-3. DIGRAPHS AND BINARY RELATIONS 

The theory of graphs and the calculus of binary relations are closely 
related (so much so that some people often mistakenly come to regard graph 
theory as a branch of the theory of relations). 

In a set of objects, X , where 

X = (x,,x 2 ,. . 

a binary relation R between pairs (x,., x } ) may exist. In which case, we write 

x ; Rxj 

and say that x,. has relation R to x r 

Relation R may for instance be “is parallel to,” “is orthogonal to,” or 
“is congruent to” in geometry. It may be “is greater than,” “is a factor of,” 
“is equal to,” and so on, in the case when X consists of numbers. On the other 
hand, if the set X is composed of people, the relation R may be “is son of,” 
“is spouse of,” “is friend of,” and so forth. Each of these relations is defined 
only on pairs of numbers of the set, and this is why the name binary relation. 
Although there are relations other than binary (x,. “is a product of” x y and 
x k , for example, will be a tertiary relation), binary relations are the most 
important in mathematics, and the word “relation” implies a binary relation. 

A digraph is the most natural way of representing a binary relation on a 
set X. Each x,. e X is represented by a vertex x r If x t has the specified relation 
R to Xj, a directed edge is drawn from vertex x,. to x jy for every pair (x,., x y ). 
For example, the digraph in Fig. 9-4 represents the relation “is greater than” 
on a set consisting of five numbers (3, 4, 7, 5, 8}. 

Clearly, every binary relation on a finite set can be represented by a 
digraph without parallel edges. Conversely, every digraph without parallel 
edges defines a binary relation on the set of its vertices. 
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Fig. 9-5 Graphs of symmetric binary relation. 

Reflexive Relation: For some relation R it may happen that every element 
is in relation R to itself. For example, a number is always equal to itself, or a 
line is always parallel to itself. Such a relation R on set X that satisfies 


Xi RX; 

for every x. e X is called a reflexive relation. The digraph of a reflexive 
relation will have a self-loop at every vertex. Such a digraph representing a 
reflexive binary relation on its vertex set may be called a reflexive digraph. A 
digraph in which no vertex has a self-loop is called an irreflexive digraph. 

Symmetric Relation: For some relation R it may happen that for all x ( . and 
Xj, if 

x ( RXj holds, then XjRx t also holds. 

Such a relation is called a symmetric relation. “Is spouse of” is a symmetric 
but irreflexive relation. “Is equal to” is both symmetric and reflexive. 

The digraph of a symmetric relation is a symmetric digraph because for 
every directed edge from vertex x f to x y there is a directed edge from Xj to x,.. 
Figure 9-5(a) shows the graph of an irreflexive, symmetric binary relation on 
a set of four elements. The same relation can also be represented by drawing 
just one undirected edge between every pair of vertices that are related, as in 
Fig. 9-5(b). Thus every undirected graph is a representation of some sym- 
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metric binary relation (on the set of its vertices). Furthermore, every 
undirected graph with e edges can be thought of as a symmetric digraph with 
2e directed edges. (A two-way street is equivalent to two one-way streets 
pointed in opposite directions.) 

Transitive Relation: A relation R is said to be transitive if for any three 
elements jc,., Xj, and x k in the set, 


always imply 


X;Rxj and XjRx k 


x.Rx, 


The binary relation “is greater than,” for example, is a transitive relation. If 
x t > Xj and x } > x k , clearly jc. > x k . “Is descendent of” is another example 
of a transitive relation. 

The digraph of a transitive (but irreflexive and asymmetric) binary relation 
is shown in Fig. 9-4. Note the triangular subgraphs. A digraph representing a 
transitive relation (on its vertex set) is called a transitive directed graph. 

Equivalence Relation: A binary relation is called an equivalence relation if 
it is reflexive, symmetric, and transitive. Some examples of equivalence 
relations are “is parallel to,” “is equal to,” “is congruent to,” “is equal to 
modulo m,” and “is isomorphic to.” 

The graph representing an equivalence relation may be called an equiva¬ 
lence graph. What does an equivalence graph look like? Let us look at an 
example, consisting of the equivalence relation “is congruent to modulo 3” 
defined on the set of 11 integers, 10 through 20. The graph is shown in Fig. 
9-6. (Recall that each undirected edge in Fig. 9-6 represents two parallel but 
oppositely directed edges.) 

In Fig. 9-6 we see that the vertex set of the graph is divided into three 
disjoint classes, each in a separate component. Each component is an 
undirected subgraph (due to symmetry) with a self-loop at each vertex (due to 
reflexivity). Furthermore, in each component every vertex is related to (i.e., 
joined by an edge to) every other vertex. 



= 2 (mod 3) 

Fig. 9-6 Equivalence graph. 


= 1 (mod 3) 


= 0 (mod 3) 
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In general, an equivalence relation on a set partitions the elements of the 
set into classes (called equivalence classes) such that two elements are in the 
same class if and only if they are related. Symmetry ensures that there is no 
ambiguity regarding membership in the equivalence class; otherwise, x t may 
have been related to Xj but not vice versa. Transitivity ensures that in each 
component every vertex is joined to every other vertex, because if a is related 
to b and b is related to c, a is also related to c. Transitivity also guarantees 
that no element can be in more than one class. Reflexivity allows an element 
to be in a class by itself, if it is not related to any other element in the set. 

Relation Matrices: A binary relation R on a set can also be represented by 
a matrix, called a relation matrix. It is a (0, 1), n by n matrix, where n is the 
number of elements in the set. The /, /th entry in the matrix is 1 if x t Rxj is 
true, and is 0, otherwise. For example, the relation matrix of the relation “is 
greater than” on the set of integers [3, 4, 7, 5, 8} is 


3 4 

3 [0 0 

4 1 0 

7 1 1 

5 1 1 

8 1 1 


7 

0 

0 

0 

0 

1 


5 

0 

0 

1 

0 

1 


8 

0 

0 

0 

0 

0 


We shall see in Section 9-8 that this is precisely the adjacency matrix of the 
digraph representing the binary relation. 


9-4. DIRECTED PATHS AND CONNECTEDNESS 

Walks, paths, and circuits in a directed graph, in addition to being what 
they are in the corresponding undirected graph, have the added consideration 
of orientation. For example, in Fig. 9-1, the sequence of vertices and edges 
v 5 e s v 3 e 6 v \ e 3 v i is a path “directed” from v 5 to v,, whereas v 5 e 7 v A e 6 v 3 <?, v y 
(although a path in the corresponding undirected graph) has no such consis¬ 
tent direction from v $ to v r A distinction must be made between these two 
types of paths. It is natural to call the first one a directed path from v 5 to v u 
and the second one a semipath. The word “path” in a digraph could mean 
either a directed path or a semipath, and similarly for walks, circuits, and 
cutsets. More precisely: 

A directed walk from a vertex v t to Vj is an alternating sequence of vertices 
and edges, beginning with v i and ending with Vj, such that each edge is 
oriented from the vertex preceding it to the vertex following it. Of course, no 
edge in a directed walk appears more than once, but a vertex may appear 
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more than once, just as in the case of undirected graphs. A semiwalk in a 
directed graph is a walk in the corresponding undirected graph, but is not 
a directed walk. A walk in a digraph can mean either a directed walk or a 
semiwalk. 

The definitions of circuit, semicircuit, and directed circuit can be written 
similarly. Let us turn to Fig. 9-1 once more. The set of edges {e 1 ,e 6 , e 3 ] is a 
directed circuit. But fc,, e 6 , e 2 j is a semicircuit. Both of them are circuits. 

Connected Digraphs: In Chapter 2 a graph (i.e., undirected graph) was 
defined as connected if there was at least one path between every pair of 
vertices. In a digraph there are two different types of paths. Consequently, we 
have two different types of connectedness in digraphs. A digraph G is said to 
be strongly connected if there is at least one directed path from every vertex 
to every other vertex. A digraph G is said to be weakly connected if its corre¬ 
sponding undirected graph is connected but G is not strongly connected. In 
Fig. 9-2 one of the digraphs is strongly connected, and the other one is weakly 
connected. The statement that a digraph G is connected simply means that its 
corresponding undirected graph is connected; and thus G may be strongly or 
weakly connected. A directed graph that is not connected is dubbed as 
disconnected. 

Since there are two types of connectedness in a digraph, we can define two 
types of components also. Each maximal connected (weakly or strongly) 
subgraph of a digraph G will still be called a component of G. But within each 
component of G the maximal strongly connected subgraphs will be called the 
fragments (or strongly connected fragments) of G. 

For example, the digraph in Fig. 9-7 consists of two components. The 
component g, contains three fragments (c,, e 2 }, {e 5 , e 6 , e 7 , e 8 }, and {c 10 }. 
Observe that e 3 , e 4 , and e 9 do not appear in any fragment of g t . 



Fig. 9-7 Disconnected digraph with two components. 
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Fig. 9-8 Condensation of Fig. 9-7. 


Condensation: The condensation G c of a digraph G is a digraph in which 
each strongly connected fragment is replaced by a vertex, and all directed 
edges from one strongly connected component to another are replaced by a 
single directed edge. The condensation of the digraph G in Fig. 9-7 is shown 
in Fig. 9-8. 

Two observations can be made from the definition: 


1. The condensation of a strongly connected digraph is simply a vertex. 

2. The condensation of a digraph has no directed circuit. 

Accessibility: In a digraph a vertex b is said to be accessible (or reachable) 
from vertex a if there is a directed path from a to b. Clearly, a digraph G is 
strongly connected if and only if every vertex in G is accessible from every 
other vertex. 


9-5. EULER DIGRAPHS 

The notion of the Euler graph can be extended to digraphs also. In a 
digraph G a closed directed walk (i.e., a directed walk that starts and ends at 
the same vertex) which traverses every edge of G exactly once is called a 
directed Euler line. A digraph containing a directed Euler line is called an 
Euler digraph. The graph in Fig. 9-9 is an Euler digraph, in which the walk 
a b c d e /is an Euler line. 

When is a digraph an Euler digraph? Clearly, the digraph must be 
connected, with the possible exception of isolated vertices; otherwise, every 
edge cannot be traversed in one walk. In fact, an Euler digraph must be 
strongly connected, although every strongly connected digraph need not be 
an Euler digraph (Problem 9-13). Theorem 9-1 (whose proof follows the 
proof of Theorem 2-4 almost verbatim) provides a very simple test for 
determining whether or not a digraph has an Euler line. 
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Theorem 9-1 

A digraph G is an Euler digraph if and only if G is connected and is balanced 
[i.e., d~(v) = d + (v) for every vertex v in G]. 

Let us now consider an application of the Euler digraph for solving an 
important problem in communication theory. The problem, which is often 
referred to as the teleprinter's problem , was solved in 1940 by I. G. Good 
using the digraph, and was presented in a classic paper in 1946 [9-2]. 

Teleprinter’s Problem: How long is a longest circular (or cyclic) sequence 
of l’s and 0’s such that no subsequence of r bits appears more than once in 
the sequence? Construct one such longest sequence. 

Solution: Since there are 2 r distinct r-tuples formed from 0 and 1, the 
sequence can be no longer than 2 r bits long. Using Theorem 9-1, we shall 
construct a circular sequence 2 r bits long with the required property that no 
subsequence of r bits be repeated. 

Construct a digraph G whose vertices are all (r — l)-tuples of 0’s and l’s. 
Clearly, there are 2 r_1 vertices in G. Let a typical vertex be 

«,a 2 ... tx r - ,, where a, = 0 or 1. 

Draw an edge directed from this vertex (a, a 2 . . . a r _i) to each of two 
vertices (a 2 a 3 . . . a r _i0) and (a 2 a 3 . . . a r _il); label these directed edges 
a, a 2 . . . a r _i0 and a, a 2 . . . a r _! 1, respectively. Draw two such edges directed 
from each of the 2 r_1 vertices. (A self-loop will result in each of the two cases 
when a, = a 2 = • • • = a r _i = 0 or 1.) 

The resulting digraph is an Euler digraph because for each vertex the 
in-degree equals the out-degree (each being equal to two). A directed Euler 
line in G consists of the 2 r edges, each with a distinct r-bit label. The labels of 
any two consecutive edges in the Euler line are of the form a, a 2 . . . a r _i a r ; 
a 2 a 3 . .. <x r a r+1 ; that is, the r — 1 trailing bits of the first edge are identical 
to the r — 1 leading bits of the second edge. Thus in the sequence of 2 r bits, 
made of the first bit of each of the edges in the Euler line, every possible 



sec. 9-5 


EULER DIGRAPHS 


205 




Fig; 9-10 Euler digraph for maximum-length sequence. 


subsequence of r bits occurs as the label of an edge; and since no two edges 
have the same label, no subsequence occurs more than once. The circular 
arrangement is achieved by joining the two ends of the sequence.) 

For r = 4, the graph in Fig. 9-10 illustrates the procedure of obtaining 
such a maximum-length sequence. One such sequence is 0000101001101111 
corresponding to the walk e y e 2 e 3 e 4 e 5 e 6 e n e s e 9 e 10 e u e 12 e 13 e i4 e 15 e l6 . 

This problem is also encountered in locating the position of a rotating 
drum (Problem 9-19) with the surface carrying two different types of marks. 
The problem of the rotating drum is similar to that of a feedback shift register 
(to be discussed in Chapter 12). The quest for a word in which each arrange¬ 
ment of r letters of a given alphabet appears exactly once, encountered in 
cryptography, is also the same problem (Problem 9-20). 

It is not difficult to see that the alphabet size need not be 2. It could be any 
number m. In that case, the maximum-length sequence is m r symbols long. 
The in-degree and out-degree of each vertex in the corresponding Euler 
digraph equals m, rather than two. 

Number of Euler Lines: In Fig. 9-10 there is more than one Euler line. In 
fact, the digraph has 16 distinct Euler lines. (Note that rotations of the same 
sequence of edges are not considered distinct.) Finding the number of distinct 
directed Euler lines in a given Euler digraph is also of interest in many 
applications. This problem of enumeration was solved by N. G. deBruijn in 
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1946 for those regular Euler digraphs in which the in-degree and out-degree 
of every vertex were exactly two, the digraph in Fig. 9-10, for example. In 
1951 van Aardenne-Ehrenfest and deBruijn [9-9] solved the more general 
problem of counting the number of distinct directed Euler lines in any Euler 
digraph. 

The number of distinct Euler lines in a balanced, connected digraph G can 
be obtained by counting certain types of spanning trees in G , to be covered in 
the next section. 


9-6. TREES WITH DIRECTED EDGES 

A tree (for undirected graphs) was defined as a connected graph without 
any circuit. The basic concept as well as the term “tree” remains the same for 
digraphs. A tree is a connected digraph that has no circuit—neither a 
directed circuit nor a semicircuit. A tree of n vertices contains n — 1 directed 
edges and has properties similar to those with undirected edges. Trees with 
directed edges are of great importance in many applications, such as electrical 
network analysis, game theory, theory of languages, computer programming, 
and counting problems, to name a few. 

In addition to being trees in the undirected sense, trees in digraphs have 
additional properties and variations resulting from the relative orientations 
of the edges. One such particularly useful type of rooted tree with directed 
edges is called an arborescence and is defined as follows: 

Arborescence: A digraph G is said to be an arborescence if 

1. G contains no circuit—neither directed nor semicircuit. 

2. In G there is precisely one vertex v of zero in-degree. 

This vertex v is called the root of the arborescence. An arborescence is shown 
in Fig. 9-11. 



Fig. 9-11 Arborescence. 
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Theorem 9-2 

An arborescence is a tree in which every vertex other than the root has an 
in-degree of exactly one. 

Proof: An arborescence with n vertices can have at most n — 1 edges because 
of condition 1. Therefore, the sum of in-degrees of all vertices in G 

d~(v x ) + d~(v 2 ) + • • • + d~(v n ) < n — 1. 

Of the n terms on the left-hand side of this equation, only one is zero because of 
condition 2; others must all be positive integers. Therefore, they must all be l’s. 
Now, since there are exactly n — 1 vertices of in-degree one and one vertex of 
in-degree zero, digraph G has exactly n — 1 edges. Since G is also circuitless, it 
must be connected, and hence a tree. ■ 

An arborescence is in a sense a tree directed out of the root. Therefore, an 
arborescence is sometimes referred to as an out-tree. (Reversing the direction 
of every edge in an arborescence will produce what may be called an in-tree.) 

Theorem 9-3 

In an arborescence there is a directed path from the root R to every other 
vertex. Conversely, a circuitless digraph G is an arborescence if there is a vertex 
v in G such that every other vertex is accessible from v, and v is not accessible from 
any other vertex. 

Proof: (a) In an arborescence consider a directed path P starting from the root 
R and continuing as far as possible. P can end only at a pendant vertex; otherwise, 
we get a vertex whose in-degree is two or more. A contradiction. 

Since an arborescence is connected, every vertex lies on some directed path 
from the root R to each of the pendant vertices. 

(b) Conversely, since every vertex in G is accessible from v, and G has no 
circuit, G is a tree. Moreover, since v is not accessible from any other vertex, 
d~(v ) = 0. Every other vertex is accessible from v, and therefore the in-degree of 
each of these vertices must be at least one. The in-degree cannot be greater than 
one because there are only n — 1 edges in G ( n being the number of vertices in 

G). m 


The following is an important application of arborescences to the theory 
of computer algorithms. 

Polish Notation: Consider the arithmetic expression 


a + b 


C'd 

g x ~f 


(9-1) 


In a procedural language (such as FORTRAN or ALGOL) this expression 
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might be written as 


© 

1 

* 

• 

G 

*C> 

+ 

o 

(9-2) 

where | denotes exponentiation. 



In evaluating this expression the computer must perform the arithmetic 
operations in a certain order; otherwise, it will produce a wrong result. Let us 
number the operations in this expression in the order in which they might be 
performed. 

a + b — c-d -E (g t x — /). (9-3) 

© © © © © © 

To evaluate such an expression, the machine will have to scan the expres¬ 
sion back and forth to find the sequence of operations to be performed. 

To avoid scanning back and forth, the computer makes a preliminary 
translation of expressions such as (9-2) into the Polish notation (invented by 
the Polish logician, Lukasiewicz). Polish notation is also called parenthesis- 
free notation , because it contains no parentheses. This notation has the 
advantage that the operations appear exactly in the same order as they are 
performed. 

The basic idea in Polish notation is that a binary operator appears just to 
the left of the two operands rather than in between the two operands. Thus 
* -f y is written as + xy. The translation of expressions from procedural 
language into Polish notation is extremely important in compiling and can be 
accomplished by first representing the given expression by means of an 
arborescence as follows: Each variable (or constant) appearing in the expres¬ 
sion is represented by a pendant vertex. Each internal vertex represents a 
binary operator having the two subarborescences as its operands. An arbo¬ 
rescence for expression (9-2) is shown in Fig. 9-12. 

To obtain the expression in Polish notation, we traverse the arborescence 



Fig. 9-12 Arborescence for 
a + b — c • d + (g\ x —f). 
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starting from the root from left to right and from top to bottom, as indicated 
by the dotted line in Fig. 9-12. Each time we come across a vertex that has not 
been traversed before, we append its label to the existing string. This process 
in Fig. 9-12 yields 

a — b ~ -cd — \ gxf. (9-4) 

An expression in Polish notation is evaluated as follows: We start at the 
right extreme and move to the left. Whenever an operator is encountered the 
operation is performed between the two operands immediately to the right of 
it. After an operation is performed, the resultant is regarded as one operand 
for the next operation. You can verify that under this procedure expression 
(9-4) is equal to (9-2). The advantage of expression (9-4) over (9-2) is that in 
(9-4) there are no parentheses and the operators appear in the order (from 
right to left) in which they are to be acted upon. Therefore, no back and 
forth scanning is required during the computation. 

Ordered Trees: You must have noticed that in the expression arborescence 
of Fig. 9-12 the relative positions of the vertices in the plane of the paper—- 
left or right, up or down—are important and must be preserved. In this 
sense it is a “rigid” graph, and a graph isomorphic to it may not preserve its 
properties. This in fact is not purely a graph-theoretic problem; and this is 
the first and only time in this book we shall consider such a structure. 

In computer literature a tree in which the relative order of subtrees 
meeting at each vertex must be preserved is called an ordered tree or a planar 
tree (because the tree can be visualized as rigidly embedded in the plane of the 
paper). In computer science the term tree usually means an ordered tree, and 
by convention a tree is drawn hanging down with the root at the top. 

Spanning Arborescence: A spanning tree in an w-vertex connected digraph, 
analogous to a spanning tree in an undirected graph, consists of n — 1 
directed edges. A spanning arborescence in a connected digraph is a spanning 
tree that is an arborescence. For example, a spanning arborescence in Fig. 
9-13 is {/, b, d). There is a striking relationship between a spanning arbo¬ 
rescence and an Euler line. This is brought out by Theorems 9-4 and 9-5. 
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Theorem 9-4 

In a connected, balanced digraph G of n vertices and m edges, let W = (e t , e 2 , 
. . . , e m ) be an Euler line, which starts and ends at a vertex v (i.e., v is the initial 
vertex of e x and the terminal vertex of e m ). Among the m edges in W there are 
n — 1 edges that “enter” each of n — 1 vertices, other than v, for the first time. 
The subdigraph g of these n — 1 directed edges together with the n vertices is a 
spanning arborescence of G, rooted at vertex v. 

Illustration: Before proving the theorem, let us look at an example. In 
Fig. 9-13, W=(bdcefgh a) is an Euler line, starting and ending at vertex 
2. The subdigraph { b , d, /} is a spanning arborescence rooted at vertex 2. 

Proof: In the subdigraph g, vertex v is of in-degree zero, and every other 
vertex is of in-degree one; for g includes exactly one edge going to each of the 
n — 1 vertices, and no edge going to v. Moreover, the way g is defined in W, 
g is connected and contains n — 1 directed edges. Therefore, g is a spanning 
arborescence in G and is rooted at v. ■ 

Theorem 9-4 provides a method of obtaining a spanning arborescence 
rooted at any specified vertex, provided the digraph is Eulerian. Conversely, 
given a spanning arborescence in an Euler digraph, an Euler line can be 
constructed using Theorem 9-5. This important result discovered by T. van 
Aardenne-Ehrenfest and N. G. deBruijn in 1951 is used in counting the num¬ 
ber of distinct Euler lines. 

For the sake of traversing the edges along with rather than opposite to the 
direction of edges, it is better to express Theorem 9-5 in terms of an in-tree, 
that is, an arborescence in which the direction of every edge has been inverted. 
In Fig. 9-14 the subdigraph {e 2 , e 3 , e 7 , e J0 , e x ,} is a spanning in-tree. 

Theorem 9-5 

Let G be an Euler digraph and Tbe a spanning in-tree in G, rooted at a vertex 
R. Let e t be an edge in G incident out of the vertex R. Then a directed walk 
W =» (e u e 2 , ■ ■ ■ , c m ) is a directed Euler line, if it is constructed as follows: 



Fig. 9-14 Spanning in-tree rooted at R. 
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1. No edge is included in W more than once. 

2. In exiting a vertex the one edge belonging to T is not used until all other 
outgoing edges have been traversed. 

3. The walk is terminated only when a vertex is reached from which there is 
no edge left on which to exit. 

Proof: The walk W must terminate at R, because all vertices must have been 
entered as often as they have been left (because G is balanced). Now suppose 
there is an edge a in C that has not been included in W. Let v be the terminal vertex 
of a. Since G is balanced, v must also be the initial vertex of some edge b not 
included in W. Edge b going out of vertex v must be in T, according to rule 1. 
This omitted edge leads to another omitted edge c in T, and so on. Ultimately, 
we arrive at R, and find an outgoing edge there not included in W. This contradicts 
rule 3. ■ 

The number of distinct Euler lines formed from a given in-tree T, and 
starting with edge e y at R, can be computed by considering all the choices 
available at each vertex, after starting with e y . 

Since there is exactly one outgoing edge in T at each vertex and this edge is 
to be selected last (rule 2), the remaining d + (v i ) — 1 edges at vertex v t can be 
chosen in 


[r/ + (v ( .) — 1]! ways. 

And since these are independent choices, we have altogether 


n 


n [d + (Vt) - 1]! 

I=S 1 


(9-5) 


different Euler lines that meet the three rules in Theorem 9-5. 

Let us apply these three rules to obtain different directed Euler lines in 
Fig. 9-14, from the in-tree {e 2 , e 3 , e 7 , e l0 , e n ], starting with edge e t . We get 
the following two directed Euler lines: 


and 


(e y e l2 c 5 e 6 e 7 e 8 e 9 e i0 e iy e 2 e 4 e 3 ), 
(c, e I2 e 8 e 9 e i0 e iy e s e 6 e 7 e 2 e 4 e 3 ). 


The value of expression (9-5) for Fig. 9-14 is 2. Note that these are not all the 
directed Euler lines in the digraph, but only those that are generated by the 
specific in-tree in accordance with the rules in Theorem 9-5. 

The result in Theorem 9-5 may seem contrived at first sight, but it is a 
very natural step in counting the number of distinct directed Euler lines in a 
digraph, which will be undertaken in Section 9-9. 
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9-7. FUNDAMENTAL CIRCUITS IN DIGRAPHS 

The edges of a connected digraph not included in a specified spanning tree 
T are also called chords with respect to T. Just as in the case of undirected 
graphs, every chord c t when added to the spanning tree T produces a funda¬ 
mental circuit (which may be a directed circuit or a semicircuit). 

A cut-set in a connected digraph G (just as in an undirected graph) induces 
a partitioning of the vertices of G into two disjoint subsets V t and V 2 such 
that the cut-set consists of all those edges that have one end vertex in V x and 
the other in V 2 . All edges in the cut-set may be directed from V x to V 2 , or vice 
versa, or some edges may be directed from F, to V 2 and others from V 2 to V X A 

The concepts of spanning trees, fundamental circuits, and fundamental 
cut-sets are illustrated in Fig. 9-15. A spanning tree is shown in heavy lines. 
Observe that some of the fundamental circuits are directed circuits and 
others are semicircuits. The five fundamental cut-sets, each corresponding to 
an edge in the spanning tree, are also shown. 

Ring Sum of Circuits: Just as in undirected graphs, we can define opera¬ 
tions between subgraphs of a digraph. In particular, the ring sum of two 
subdigraphs g 1 @ g 2 is another subdigraph consisting of edges that are either 
in gj or in g 2 but not in both. 

As in undirected graphs, the ring sum of two circuits (directed or semi¬ 
circuit) in a digraph is either a third circuit or a union of edge-disjoint circuits. 
For if we disregard the directions of edges in the circuits, the earlier results 
from undirected graphs are also applicable to digraphs. Under the ring-sum 
operation, ©, circuits and unions of edge-disjoint circuits form a group. 
Every element of this group can be expressed as a ring sum of some of the 
fundamental circuits with respect to a spanning tree. The same holds for cut¬ 
sets also. 

Set of Directed Circuits Only: The most important property of a set of 
H {= e — «+l) fundamental circuits in a connected graph G (directed or 
undirected) is that these circuits form a basis for all circuits in G. Any circuit 
(directed or semicircuit in a digraph) can be obtained as a ring sum of some 
of these circuits. But in many applications we are interested only in the set of 
directed circuits in a digraph. Is there a similar basis for all directed circuits? 

The answer, unfortunately, is no. The ring sum of two directed circuits is 
not necessarily another directed circuit or edge-disjoint union of directed 
circuits. For example, in Fig. 9-15 the ring sum of directed circuits 
acb(f)adgb = dgc, a semicircuit. In fact, it can be shown that there 
exists no binary operation under which all directed circuits (and edge-disjoint 
unions of directed circuits) form a group, let alone a vector space. 

|A cut-set in which all edges are oriented in the same direction is called a directed cut-set. 
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Rank r = 5 
Nullity n - 4 


Spanning tree T {a,d,J,h,k\ 

Chord-set with respect 
toT- \b,c,e,g\ 


Fundamental circuits 
with respect to T 


Fundamental cut-sets 
with respect to T 




< 



r 


< 


dfe 

(semicircuit) 

d k h c 

(semicircuit) 

k h g 

(semicircuit) 

a d k h b 

(directed circuit) 

a b 


b c d e 


ef 


be g k 


b c gh 



Fig. 9-15 Directed graph and a spanning tree. 


The practical significance of this situation is to be discussed in connection 
with acyclic digraphs, the topic of Section 9-11. 


9-8. MATRICES A, B, AND C OF DIGRAPHS 

The matrices associated with a digraph are quite similar to those discussed 
in Chapter 7 for an undirected graph, with the following basic difference. In 
order to account for the orientation of the edges, the incidence, circuits, and 
cut-set matrices consist of + 1,0, — 1 (instead of only 0 and 1 for undirected 
graphs). The numbers +1, 0, —1 are regarded as ordinary real numbers. 
Their addition and multiplication are interpreted as in ordinary arithmetic 
(and not modulo 2 arithmetic as in the case of undirected graphs). Conse- 
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quently, the vectors and vector spaces associated with a digraph and its 
subdigraphs are over the field of all real numbers, and not GF(2). 

Incidence Matrix: The incidence matrix of a digraph with n vertices, e 
edges, and no self-loops is an n by n matrix A = [a iy ], whose rows correspond 
to vertices and columns correspond to edges, such that 

a u = 1, ifyth edge is incident out of zth vertex, 

= — 1, if jth edge is incident into zth vertex, 

= 0, ifyth edge is not incident on zth vertex. 

A digraph and its incidence matrix are shown in Fig. 9-16. Observe that if we 
disregard the orientations of the edges and correspondingly change — 1 to 1 
in the incidence matrix, Fig. 9-16 becomes identical to Fig. 7-1. 

Observations 1-6 made in Section 7-1 on the properties of the incidence 
matrix of an undirected graph, with minor changes, also hold for digraphs. 

Since the sum (in the real field) of each column is zero, the rank of the 
incidence matrix of a digraph of n vertices is less than n. In fact, we have the 
following theorem, identical to Theorem 7-2, which can be proved along 
similar lines. 

Theorem 9-6 

If A(G) is the incidence matrix of a connected digraph of n vertices, the rank 
of A(G) = n - 1. 


Deleting any one row from A we get A f , the (n — 1) by e reduced incidence 
matrix. The vertex corresponding to the deleted row is called the reference 
vertex. 

Unimodularity of A: It was observed in Chapter 7 that if A is the incidence 
matrix of an undirected graph, the determinant of every square submatrix of 
A is either 0 or 1. This was a result of the fact that the determinant was defined 
in modulo 2 arithmetic and, therefore, could have no other value. 

In the case of digraphs, the incidence matrix A is in the real field, and on 
first sight it would appear that the determinants of its square submatrices 
could acquire any integral value. This, however, is not the case, as shown in 
the following important theorem. 

Theorem 9-7 

The determinant of every square submatrix of A, the incidence matrix of a 
digraph, is 1, —1, or 0. 

Proof: The theorem can be proved directly by expanding the determinant of 
a square submatrix of A. Consider a k by k submatrix M of A. If M has any column 
or row consisting of all zeros, det M is clearly zero. Also det M = 0 if every column 
of M contains the two nonzero entries, a 1 and a —1. 
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Now if det M ^ 0 (i.e., M is nonsingular), then the sum of entries in each 
column of M cannot be zero. Therefore, M must have a column in which there is 
a single nonzero element that is either +1 or —1. Let this single element be in the 
(#,y)th position in M. Thus 


det M = ± 1 • det M, v , 

where M, v is the submatrix of M with its ith row and yth column deleted. The 
(k — 1) by (k — 1) submatrix M u is also nonsingular (because M is nonsingular); 
therefore, it too must have at least one column with a single nonzero entry, say 
in the(p, q)\h position. Expanding det M i7 about this element in the(/>,<7)th position, 

det M ij = ±[determinant of a nonsingular 

(k — 2) by (k — 2) submatrix of M]. 
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Repeated application of this procedure yields 

det M = ±1. 

Hence the theorem. ■ 

Any matrix with every square submatrix having a determinant of 1, — 1, 
or 0 is called a unimodular matrix. (Unimodular matrices also play an impor¬ 
tant role in linear programming.) 

Circuit Matrix of a Digraph: Let G be a digraph with e edges and q 
circuits (directed circuits or semicircuits). An arbitrary orientation (clockwise 
or counterclockwise) is assigned to each of the q circuits. Then a circuit 
matrix B = [b tJ ] of the digraph G is a q by e matrix defined as 

b ij = 1, if /th circuit includesyth edge, and the orientations of the edge 
and circuit coincide, 

= — 1, if /th circuit includes /th edge, but the orientations of the 
two are opposite, 

= 0, if /th circuit does not include the yth edge. 

For example, a circuit matrix of the digraph in Fig. 9-16 is 

a b c d e f g h 

' 0 0 0 1 0 1 1 O' 

001 0-1 010 * 
001 - 1 - 1-100 
1 1 0 0 0 0 0 0 _ 

Note that the orientation assigned to each of the four circuits is entirely 
arbitrary. The circuit in the first row is assigned clockwise orientation, in the 
second row counterclockwise, the third counterclockwise, and the fourth 
clockwise. Changing the orientation of any circuit will simply change the sign 
of every nonzero entry in the corresponding row. Also note that if we subtract 
the first row from the second, we get the third row. Thus the rows are not all 
linearly independent (in the real field, of course). 

Observations 1-7 made in Section 7-3 about the circuit matrix of an 
undirected graph are applicable to the circuit matrix of a digraph also—with 
some obvious minor changes. Just as for undirected graphs, the rows of the 
circuit matrix are orthogonal to the rows of the incidence matrix (this time in 
the real field), as proved in Theorem 9-8. 

Theorem 9-8 

Let B and A be, respectively, the circuit matrix and incidence matrix of a self¬ 
loop-free digraph such that the columns in B and A are arranged using the same 
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order of edges. Then 

A-B r = B* A r = 0, 

where superscript T denotes the transposed matrix. 

Proof: Consider the wth row in B and the kih row in A. If the circuit m does 
not include any edge incident on vertex k, the product of the two rows is clearly 
zero. If, on the other hand, vertex k is in circuit m, there are exactly two edges 
(say x and y) incident on k that are also in circuit m. This situation can occur in 
only four different ways, as shown in Fig. 9-17. (The other four cases with the 
orientation of m reversed are identical to these when x and y are interchanged.) 

The possible entries in row k of A and row m of B in column positions x and 
y are tabulated for each of these four cases. 


Case 

Row k 

Row m 

Dot Product 


Column x 

Column y 

Column x 

Column y 

Row k-Row m 

(a) 

-1 

1 

1 

1 

0 

(b) 

1 

-1 

-1 

-1 

0 

(c) 

-1 

-1 

1 

-1 

0 

(d) 

1 

1 

-1 

1 

0 


In each case, the dot product is zero. Therefore, the theorem. ■ 

Using Sylvester’s theorem (Appendix B) and Theorem 9-8, it can be shown 



(c) (d) 

Fig. 9-17 Vertex k in circuit m. 
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that in a digraph with e edges 

rank of B + rank of A = e. 

Moreover, for a connected graph 

rank of A = n — 1, 

and therefore 

rank of B = e — n + 1. 

The following two important properties of matrices A and B hold for 
digraphs also, and can be proved as was done for undirected graphs in Chapter 
7 (except that here we are working in ordinary real arithmetic and not in 
modulo 2 arithmetic). 

1. The nonsingular submatrices of order n — 1 of A are in one-to-one 
correspondence with the spanning trees of the connected digraph of n 
vertices. 

2. The nonsingular submatrices of B of order p (= e — n + I) are in 
one-to-one correspondence with the chord set (complement of the 
spanning tree) of the connected digraph of n vertices and e edges. 

Sign of a Spanning Tree: For a digraph the determinant of the nonsingular 
submatrix of A corresponding to a spanning tree T can assume either a value 
of + 1 or — 1. This is referred to as the sign of T. 

As we shall see in Chapter 13, in the analysis of a certain class of electrical 
networks it is necessary to know the signs of the spanning trees. Note that the 
sign of a spanning tree is defined only for a particular ordering of vertices and 
edges in A (because interchanging two rows or columns in a matrix changes 
the sign of its determinant). Thus the sign of a spanning tree is relative. Once 
the sign of one spanning tree is arbitrarily chosen, the sign of every other 
spanning tree is determined as positive or negative with respect to this 
spanning tree. 

Number of Spanning Trees: We have Theorem 9-9 for determining the 
number of spanning trees in a connected digraph. (An identical result for 
undirected graphs was given in Problem 7-20.) 

Theorem 9-9 

Let A f be the reduced incidence matrix of a connected digraph. Then the 
number of spanning trees in the graph equals the value of 


detfAjr-Ap. 
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Proof: According to the Binet-Cauchy theorem (Appendix A) 

det(A / -Ap = sum of the products of all corresponding 
majors of A/ and A T f . 

Every major of A f or A T f is zero unless it corresponds to a spanning tree, in which 
case its value is 1 or —1. Since both majors of A f and A T f have the same value 
+ 1 or — 1, the product is +1 for each spanning tree. ■ 

Fundamental Circuit Matrix: The /j, fundamental circuits each made by a 
chord (with respect to some specified spanning tree) define a fundamental 
circuit matrix B f for a digraph. The orientation assigned to each of the 
fundamental circuits is chosen to coincide with that of the chord. Therefore, 
B f , a n by e matrix, can be expressed exactly in the same form as in the case 
of an undirected graph in Section 7-4: 

B/ = [1^ I B,], 

where \ M is the identity matrix of order /x, and the columns of B, correspond to 
the edges in a spanning tree. This is illustrated in Fig. 9-18. 



Fig. 9-18 Digraph and its fundamental circuit matrix. 
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The cut-set matrix C of a digraph G is also similarly defined. And so is its 
submatrix C f , the fundamental cut-set matrix with respect to a given spanning 
tree in G. 


9-9. ADJACENCY MATRIX OF A DIGRAPH 


Another important matrix used in the representation and study of 
digraphs is the adjacency matrix defined as follows: Let G be a digraph with 
n vertices, containing no parallel edges. Then the adjacency matrix X = [x.J 
of the digraph G is an n by n (0, l)-matrix whose element 

x u — 1 , if there is an edge directed from ith vertex to /th vertex, 

= 0, otherwise. 


A digraph and its adjacency matrix are shown in Fig. 9-19. 

The adjacency matrix occurs in many different disciplines, and therefore 
has different names. In the theory of sequential machines it is called the 
transition matrix. In the calculus of relations it is called the relation matrix. 
(Observe that the relation matrix defined in Section 9-3 is the same as the 
adjacency matrix of the corresponding digraph.) In network flows it is called 
the connection matrix. It is also known as the precedence matrix or preference 
matrix in some sociological applications. In scheduling and critical-path 
analysis the adjacency matrix is known as the predecessor matrix. 

Let us make the following observations on the properties of the adjacency 
matrix X of a digraph G. 


1. X is a symmetric matrix if and only if G is a symmetric digraph. 

2. Every nonzero element on the main diagonal represents a self-loop at 
the corresponding vertex. 



Fig. 9-19 Digraph and its adjacency matrix 
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3. There is no way of showing parallel edges in X. This is why the 
adjacency matrix is defined only for a digraph without parallel edges. 

4. The sum of each row equals the out-degree of the corresponding vertex, 
and the sum of each column equals the in-degree of the corresponding 
vertex. The number of nonzero entries in X equals the number of 
edges in G. 

5. Permutation of any two rows accompanied by a permutation of the 
corresponding columns does not alter the graph. The permutation 
merely corresponds to a reordering of the vertices. Thus two digraphs 
are isomorphic if and only if their adjacency matrices differ only by 
such permutations. 

6. If X is the adjacency matrix of a digraph G, then the transposed matrix 
X r is the adjacency matrix of a digraph G R obtained by reversing the 
direction of every edge in G. 

7. For any square (0, l)-matrix Q of order n, there exists a unique digraph 
G of n vertices, such that Q is the adjacency matrix of G. 


The adjacency matrix is used as a tool to investigate the properties of a 
digraph, specially by means of a digital computer. For example, the con¬ 
nectedness of a digraph is reflected in its adjacency matrix in the following 
fashion. 

Connectedness and the Adjacency Matrix: A digraph is disconnected if and 
only if its vertices can be ordered in such a way that its adjacency matrix X 
can be expressed as the direct sum of two square submatrices X, and X 2 as 
follows: 


rx, 

o ! 

i — 

o 

X 2 J 


(9-6) 


Such a partitioning is possible if and only if the vertices in the submatrix Xj 
have no edge going to or coming from the vertex set in X 2 . 

Similarly, a digraph is weakly connected if and only if its vertices can be 
ordered in such a way that its adjacency matrix X can be expressed in the 
form (9-7) or (9-8): 


rx, 

i 0 

l 

Lx" 

! X 2 

rx, i 

x 12 

_0 ! 

X 2 


(9-7) 

(9-8) 


where X t and X 2 are square submatrices. Form (9-7) represents the case when 
there is no directed edge going from the subdigraph corresponding to X, to the 
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one corresponding to X 2 . Form (9-8) represents the case when there is no 
directed edge going to the subdigraph corresponding to X,. 

A digraph that is neither unconnected nor weakly connected is strongly 
connected. Therefore, we conclude that a digraph G is strongly connected if 
and only if the vertices of G cannot be ordered such that its adjancency matrix 
X is expressible in the form (9-6), (9-7), or (9-8). 

Number of Edge Sequences: We have the following result, similar to 
Theorem 7-8, concerning the powers of the adjacency matrix X of a digraph G. 

Theorem 9-10 

The (i',/)th eniry in X r equals the number of different, directed edge sequences 
of r edges from the /th vertex to theyth. 

Proof: (By induction) The theorem is trivially true for r — 1 . As the inductive 
hypothesis, assume that the theorem holds for X r_1 . The (/, y)th entry in 
X r (= X r ~ 1 • X) 

= 2 ^) th entr y * n X r_1 ]-jr fcy 

k — \ 

n (9-9) 

= 2 (number of all directed edge sequences v 

k= 1 

of length r — 1 from vertex i to k)-x kj , 

according to the induction hypothesis. In (9-9), x kj = 1 or 0, depending on whether 
or not there is a directed edge from k to j. Thus a term in the sum (9-9) is nonzero 
if and only if there is a directed edge sequence of length r from i to j, whose last 
edge is from k to j. If the term is nonzero, its value equals the number of such 
edge sequences from i to j via k. This holds for every vertex k, 1 < k < «. There¬ 
fore, (9-9) is equal to the number of all possible directed edge sequences from 
1 toy. ■ 

As in the case of Theorem 7-7, it must be kept in mind that the (/,y')th 
entry in X r gives the number of all directed edge sequences from vertex i to j. 
These edge sequences fall in three different categories: 

1. Directed paths from / to j: Those directed edge sequences in which no 
vertex is traversed more than once. 

2. Directed walks from / to j: Those directed edge sequences in which a 
vertex may be traversed more than once, but no edge is traversed more 
than once. 

3. Those directed edge sequences in which an edge may also be traversed 
more than once. 

Unfortunately, there is no easy way of separating these, say, category 1 
from 2 and 3. This is why this simple method cannot be employed for 
enumerating directed paths or directed circuits of a specified length. 
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For example, examine the fourth power of the adjacency matrix of the 
digraph in Fig. 9-19: 


X 4 = 


1 

1 

1 

0 

2 


1 o o r 

2 2 11 
2 2 10 
0 0 0 0 
2 112 


The entry in the second row and third column represents two directed edge 
sequences of length four: c b gf( a directed walk from 2 to 3) and d d cb (not 
a walk). The third diagonal entry represents two directed edge sequences of 
length four beginning and ending at vertex 3: g e c b (a directed circuit) and 
gfgf (not a directed circuit). The reader should also examine the remaining 
entries in X 4 . 

Number of Arborescences: A method of counting the number of spanning 
trees in a labeled, undirected graph was suggested in Problem 7-23. There is a 
similar formula for counting the number of spanning arborescences in a 
labeled, connected, simple digraph. (Counting of the spanning arborescences 
in any connected digraph is a trivial extension of counting them in a simple, 
connected digraph. The self-loops can be discarded right away, and addition 
of a parallel edge b to an existing edge a simply doubles the number of the 
arborescences—repeating the same arborescences with a replaced by b .) 

In preparation for the arborescence counting formula, let us define, for a 
simple digraph G of n vertices, an n by n matrix called the Kirchhojf matrix 
K(<7) or K = 

k u = d~(vf in-degree of the z'th vertex, 

k ij = —x ip (i,j) th entry in the adjacency matrix, with a negative sign. 

For example, a digraph and its K matrix are shown in Fig. 9-20. 

The sum of the entries in each column in a K matrix is equal to zero, which 
means that the n rows are linearly dependent; therefore, 

det K = 0. 


Next we explore the special property of the K matrix of an arborescence. 
Theorem 9-11 

A simple digraph G of n vertices and n — 1 directed edges is an arborescence 
rooted at v x if and only if the (1, 1) cofactor of K(<7) is equal to 1. 

Proof: (a) Let G be an arborescence with n vertices and rooted at vertex v x . 
Relabel the vertices as v x , v 2 , such that vertices along every directed 
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path from the root v x have increasing indices. Permute the rows and columns of 
K(G) to conform with this relabeling. 

Since the in-degree of v x equals zero, the first column contains only zeros. 
Other entries in K((7) are 

k tj =0, i > j, 

kij = -x ijt i <j, 

k lt = 1, *> 1. 

Then the K matrix of an arborescence rooted at v x is of the form 

”0 -Xj2 -X 13 - X 14 

0 1 -x 23 -x 24 

0 0 1 —x 34 

K ( G > ^ 0 0 0 1 


[_0 0 0 0 

Clearly, the cofactor of the (1, 1) entry is 1; that is, det Kj x =1. 

(b) Conversely, let G be a simple digraph of n vertices and n — 1 edges, and 
let the (1, 1) cofactor of its K matrix be equal to 1; that is, det K n =1. 

Since detKjj ^0, every column in K n has at least one nonzero entry. 
Therefore, 

d~(vi) > 1, for i = 2, 3,. . . , n. 

There are only n — 1 edges to go around. Therefore, 



and 


d~(vi) = 1, for / — 2, 3,. . . , n, 
d~(v x ) = 0. 


Now since no vertex in G has an in-degree of more than one, if G can have any 
circuit at all, it has to be a directed circuit. Suppose that such a directed circuit 
exists, which passes through vertices v h , v it , . . ., v ir . Then the sum of the columns 
i 2 , . . . , / r in Kj i is zero. (This is because each of these columns contains exactly 
two nonzero entries, a 1 on the main diagonal, and a — 1 for the incoming edge 
from the vertex preceding it in the directed circuit.) Thus these r columns in K] x 
are linearly dependent. Hence det K n = 0, a contradiction. Therefore, G has no 
circuits. 

If G has n — 1 edges and no circuits, it must be a tree. Since in this tree 


and 


d (v x ) = 0, 

d~(vi ) = 1, for / = 2, 3, . . . , n. 


G must be an arborescence rooted at vertex v y . 
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The arguments in (a) and (b) are valid for an arborescence rooted at any vertex 
v q . Any reordering of the vertices in G corresponds to identical permutations of 
rows and columns in K(G). Such permutations do not alter the value or sign of 
the determinant. ■ 

Next we come to an important result, which was first discovered by R. 
Bott and J. P. Mayberry and was proved by W. T. Tutte. 

Theorem 9-12 

Let K (G) be the Kirchhoff matrix of a simple digraph G. Then the value of 
the (q, q) cofactor of K(G) is equal to the number of arborescences in G rooted 
at the vertex v q . 

Proof: The proof depends on the result of Theorem 9-11 and on the fact that 
the determinant of a square matrix is a linear function of its columns. Specifically, 
if P is a square matrix consisting of n column vectors, each of dimension n, that is, 


then 


P =[Pl,P2, ■ ■ ■, (Pi + Pi), ■ ■ -,Pn\ 

det P = det[ p u Pi ,. . ., p t , • • • , P n ] 

+ det[/7,,/? 2 ,,/>„]. 


(9-10) 


In graph G suppose that vertex vj has in-degree of dj. The y'th column of K(G) 
can be regarded as the sum of d } different columns, each corresponding to a graph 
in which v } has in-degree one. And then (9-10) can be repeatedly applied. After 
this, splitting of columns can be carried out for each j, j ^ q, and det K qq (G) can 
be expressed as a sum of determinants of subgraphs; that is, 

det MG) = 2 detK„(*), (9-11) 

g 

where g is a subgraph of G, with the following properties: 

1. Every vertex in g has an in-degree of exactly one, except v q . 

2. g has n — 1 vertices, and hence n — 1 edges. 

From Theorem 9-11, 


det K w (^) = 1, if and only if g is an arborescence rooted at q, 
= 0, otherwise. 


Thus the summation in (9-11) carried over all g 's equals the number of arbo¬ 
rescences rooted at v q . ■ 

Theorem 9-12 is illustrated in Fig. 9-20. The cofactor of every entry in the 
second row of the K matrix is 3. The digraph does indeed have three 
arborescences rooted at vertex 2. 

For an Euler digraph G, all cofactors of K (G) are equal, because the sum 
of each row and the sum of each column equals zero. Let this common value 
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Fig. 9-20 Digraph G, its K matrix, and all arborescences rooted at 2. 


of all cofactors of K(G) be a. This a is the number of different arborescences 
rooted at any given vertex in G. The number of different Euler lines associated 
with each of these distinct arborescences is given by Eq. (9-5). Therefore, 
Theorem 9-13 is obtained. 


Theorem 9-13 

In an Euler digraph the number of Euler lines is 

o • II ~ !]!• 

f= i 

From this theorem we can compute the number of Euler lines in any 
connected balanced digraph. As an example, let us compute the number of 
Euler lines in Fig. 9-10. Its K matrix is 

“1-1 0 0 0 0 0 0“ 

0 2 -10 -10 0 0 
0 0 2 -1 0 0 0 -1 

0 0 -1 2 -1 0 0 0 

0 0 0 0 2 -1 -1 0 

0 0 0 0 0 1-10 
0 0 0 -1 0 0 2 -1 

-1 -1 0 0 0 0 0 2 




sec. 9-10 


PAIRED COMPARISONS AND TOURNAMENTS 


227 


(In this matrix, vertices appear in the order as they do in the directed Hamil¬ 
tonian path e 2 e 3 e 4 e u e i2 e, 4 e , 5 .) 

The cofactor of any term in this matrix is 16, and therefore a = 16 in 
Theorem 9-13. Since d~(v ,.) = 2 for each v t in Fig. 9-10, 


hld-iV;)- 1 ]! = 1 . 

/ = 1 

Therefore, the number of Euler lines in Fig. 9-10 is 16. 

However, for a regular Euler digraph, such as the one in Fig. 9-10, it is 
often easier to compute the number of Euler lines by other methods 
(Problem 9-18). 

9-10. PAIRED COMPARISONS AND TOURNAMENTS 

In many experiments, specially in the social sciences, one is required to 
rank a number of given objects by comparing only two at a time. This is called 
the method of paired comparisons, and is used in situations where a numerical 
measurement is difficult, for example, individual preference for pieces of 
music. The items are presented two at a time to a subject and he is asked to 
state his preference. After having noted the results of all possible n(n — l)/2 
paired comparisons of the n objects, the experimenter ranks the n objects in 
order of preference. 

A digraph is a natural way of representing the results of a paired- 
comparison experiment. The results of a classic experiment of Kendall [9-5] 
are shown in Fig. 9-21. Six different dog foods [ 1,2, . . . , 6] were to be ranked. 
Each day two of the six delicacies were served to a dog, and the dog estab¬ 
lished preference for one food over the other according to which plate he 
finished first. The experiment was conducted for 15 days, so that all possible 
pairs could be tried. In the graph representation, an edge is drawn from the 
preferred dish to the less preferred. For example, 1 was preferred to 2 in Fig. 
9-21. Such a graph is called a preference graph. 

Establishing a rank from a given preference graph is, in general, not easy. 
In Fig. 9-21, for example, due to some canine inconsistency, the dog preferred 
food 1 over 2, 2 over 4, and then 4 over 1. So which of the three is the best? 



5 


4 


Fig. 9-21 Results of a paired-compari¬ 
son experiment. 
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On Tournaments: A similar situation is encountered in tournaments. The 
results of a round-robin tournament in which every player has played against 
every other may also be represented by a digraph in which an edge directed 
from vertex a to b represents the victory of player a over player b. This is why 
a complete asymmetric digraph was called a tournament or a complete 
tournament in Section 9-2. The digraph in Fig. 9-21 can also be viewed as the 
result of a six-player tournament. The problem of ranking players in a 
tournament is identical to that of ranking in a paired-comparison experiment. 

Ranking by Score: A straightforward method of ranking, and the one that 
has been traditionally used in round-robin tournaments, is to rank each 
player by his score. The score is the number of games the player has won. In 
terms of the dog food, the number of times the particular dish was preferred is 
its score. The score of a player in a tournament equals the out-degree of the 
corresponding vertex in the digraph. 

Thus if we use the scores for ranking, we would rank the six dog foods as 

(1,3), (2,5,6), and 4. 

That is, foods 1 and 3 are tied for the first rank; there is a three-way tie for the 
second rank; and food 4 is the least preferred. 

Ranking the vertices according to their out-degrees is not always a 
satisfactory method, although it is the easiest. In particular, this method loses 
significance if the tournament is incomplete (that is, the players do not com¬ 
pete in the same number of games). 

Ranking by Hamiltonian Path: Another method sometimes used is to rank 
the players in a directed Hamiltonian path, such that each player has defeated 
his successor. One such ranking in Fig. 9-21 is 1 3 2 5 6 4. In this context, let 
us prove the following result regarding Hamiltonian paths in a tournament. 

Theorem 9-14 

Every complete tournament has a directed Hamiltonian path. 

Proof: The theorem will be proved by induction on the number of vertices. 
By actual sketching, the theorem can be shown to hold for all complete tourna¬ 
ments of 1,2, 3, and 4 vertices. Let us make the inductive assumption that the 
theorem is true for all complete tournaments of n vertices, and then prove that it 
also holds for all tournaments of n + 1 vertices. 

Let G be any complete tournament of n + 1 vertices. Let g be an ^-vertex 
complete subtournament of G. By inductive assumption, g has a directed Hamil¬ 
tonian path. Let that path be v t v 2 ■ . . v„. Let the vertex present in G but not in 
g be called v n+1 . 

Since G is a complete tournament of n + 1 vertices, the vertex v„ +1 in G has 
a directed edge either to or from each of the other vertices v u v 2 , ■ . . , v n . The 
following three cases are possible. 

Case 1: The edge between v„ +l and v t is directed toward v t . Then we have 



sec. 9-10 


PAIRED COMPARISONS AND TOURNAMENTS 


229 


u «+l 




Case 1 Case 2 



Case 3 

Fig. 9-22 Three cases of Theorem 9-14. 

a Hamiltonian path v„ +t v x v 2 ...v n in G , and the proof is complete [Fig. 9-22(a)]. 

Case 2: There is an edge directed from v„ to v n+l . Then also we have a Hamil¬ 
tonian path in G, which is v x v 2 > . . v„ v n+l , and the proof is complete [Fig. 9-22(b)]. 

Case 3: Instead, both these edges are directed from v { to v n+1 and from v n+l 
to v n . In this case, as we move from v x to v„, we encounter a reversal of direction 
in the edges incident on v n+l . This reversal must occur because edge (v 1 ,w n+1 ) 
is directed toward v n+i , but edge ( v n , v n+l ) is directed away from v n+x . Call the 
vertex at which the first such reversal occurs v t ( v ,• may be v„ itself). Then edge 
(w,_,, v n+i ) must be directed toward v„ +l . See Fig. 9-22(c). In this case we have a 
directed Hamiltonian path v, v 2 . .. v n+t v t -v i+ , . .. v„ in G. Therefore, the 
theorem. ■ 

Coming back to the original problem of ranking the vertices, we now 
know that if the digraph is a complete tournament, at least one Hamiltonian 
ranking is always possible. 

However, this method of ranking also suffers from some drawbacks. For 
one, there may be discrepancies between such a ranking and the scores of the 
players. Second, a tournament may have more than one directed Hamiltonian 
path, and therefore several different rankings are possible. In Fig. 9-21, for 
instance, 1 3 2 5 6 4 and 1 3 5 6 2 4 are two different Hamiltonian rankings. 

Ranking with Minimum Violations: For a given ranking of the n vertices in 
any tournament (complete or incomplete), a violation is defined as an edge 
directed from v. to Vj if precedes v. in the ranking. For example, in Fig. 
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9-21 the order 1 3 2 5 6 4 has the following two violations—edges 4 to 1 and 
6 to 2. The order 3 2 5 6 4 1 has five violations, edges 1 to 3, 1 to 2, 6 to 2, 
1 to 5, and 1 to 6. 

Ranking with the minimum number of violations represents the fewest 
possible upsets for a given tournament. It can be shown that a ranking with 
minimum violations automatically includes the ranking according to scores, 
as well as a Hamiltonian ranking. Moreover, a minimum-violation ranking is 
also meaningful for incomplete tournaments. Thus this may be considered 
the best method of ranking. 

However, out of all n\ possible orders of n vertices, to find one with 
minimum violations is computationally difficult. A method using dynamic 
programming has been used and is the best available so far, but it is com¬ 
putationally slow and cumbersome. 

A minimum number of violations among all n\ rankings represents a 
smallest set of edges whose removal from the digraph will eliminate all 
directed circuits, that is, make the digraph acyclic. Acyclic digraphs are 
discussed in the next section. 

9-11. ACYCLIC DIGRAPHS AND DECYCLIZATION 

In many situations semicircuits are of no significance, and one is con¬ 
cerned only with whether or not a given digraph has a directed circuit. A 
digraph that has no directed circuit is called acyclic. Let us make the fol¬ 
lowing observations about acyclic digraphs: 

1. Every tree (with directed edges) is an acyclic digraph, but the converse 
is not true. For example, the digraph in Fig. 9-4 is acyclic, but it is not a 
tree. 

2. An acyclic digraph cannot be condensed. That is, the condensation G c 
of an acyclic digraph G is G itself. The converse is also true, because if 
G c = G, obviously G has no directed circuit. 

3. An acyclic digraph represents an irreflexive, asymmetric relation. But 
the digraph of an irreflexive, asymmetric relation is not necessarily 
acyclic. (Why?) 

4. A digraph G is acyclic if and only if every directed walk in G is also a 
directed path. 

5. Observation 4 has a significant implication: If a digraph is acyclic, the 
(/, /)th entry in X k gives the number of distinct directed paths of length 
k from the zth vertex to the y'th vertex. 

Theorem 9-15 

Every acyclic digraph G has at least one vertex with zero in-degree and at least 
one vertex with zero out-degree. 



SEC. 9-11 


ACYCLIC DIGRAPHS AND DECYCLIZATION 


231 


Proof: Consider any maximal directed path (i.e., a path whose length cannot 
be increased by an edge at either end) P in G. Let v be the vertex where P starts 
and w be the vertex where it ends. Since G is acyclic, v and w must be distinct. 
Now the vertices in G can be divided into two classes: The set V x of vertices that 
are on P, and the set V 2 of the remaining vertices. 

There is no edge incident into vertex v from any vertex in V x . Otherwise, G 
would have a directed circuit. Also, there can be no edge incident into v from any 
vertex in V 2 ‘, otherwise, the length of P could have been increased by adding this 
edge. Thus the in-degree of v, d~(v) = 0. Similarly, vertex w has no edge incident 
out of it; that is, d + (w ) =0. | 

Theorem 9-16 

A digraph G is acyclic if and only if its vertices can be ordered such that the 
adjacency matrix X is an upper (or lower) triangular matrix. 

Proof: (a) Let us assume that X is upper triangular; that is, 

X;j =0 for / > j. 

It can be.seen by direct multiplication that X 2 is also upper triangular, and so are 
X 3 , X 4 ,.. ., all powers of X. Since every diagonal entry in all powers of X is zero, 
G has no directed circuit. That is, G is acyclic. 

(b) For the second part of the theorem, assume that G is acyclic and then 
reorder the vertices of G, as follows: According to Theorem 9-15, there is at least 
one vertex in G whose in-degree is zero. In the reordering of the vertices let this 
be the first vertex v x . Now, remove v x and edges incident on v x from G. The 
remaining digraph G — v x must also be acyclic, because G was acyclic. Therefore, 
G — v x has also at least one vertex, whose in-degree in G — v x is zero. Let v 2 be 
this second vertex in the reordering. Next remove v 2 from G — v x . By continuing 
this process all the vertices are reorderedt as v x , v 2 ,. . . . 

Now consider the adjacency matrix X of G with the vertices appearing in this 
order. The first column (corresponding to v x ) has all zeros. The second column 
below the first row represents vertex v 2 in G — v x , and therefore contains all zeros. 
And so forth. 

Thus the adjacency matrix is upper triangular. This proves part (b) of the 
theorem. 

The lower triangular portion of the theorem can also be proved either by 
reordering of the vertices with zero out-degrees, or by considering X r and G R . ■ 

Given the adjacency matrix X of a digraph G, the following result is quite 
useful in finding out whether or not G is acyclic. 

Theorem 9-17 

Digraph G is acyclic if and only if det(l — X) is not equal to zero, where I is 
the identity matrix of the same size as X. 

fThis is called a topological sorting. See Section 14-8 also. 



232 


DIRECTED GRAPHS 


CHAP. 9 


Proof: Det(l — X) ^ 0 if and only if the inverse (I — X) -1 exists. But 

(I - x)" 1 — I + X + X 2 + ••• + X” + •••• (9-13)t 

This inverse (I — X) -1 exists if and only if the infinite series (9-13) converges; that is, 
X* = 0 for all k > some N (because X contains only nonnegative entries). 

However, X* = 0 for all k> N if and only if G contains no directed edge 
sequence of length N or larger. And this is possible if and only if G contains no 
cycle of any length. ■ 

Decyclization: Acyclic digraphs are of enormous importance in many 
applications. It was pointed out that directed circuits represent inconsis¬ 
tencies in ranking by paired comparisons. Directed circuits may represent 
undesirable feedback paths in an electrical network. In the project graph of a 
CPM (critical path method) or PERT (program evaluation and review 
technique) a directed circuit represents a serious error, and must be elimi¬ 
nated. This is because a directed circuit, say abca, implies that activity a must 
be completed before activity b, and b before c, and c before a. Obviously, this 
is an impossible situation and nothing will get done. A similar situation in 
computer programming often arises and is justifiably known as the deadly 
encounter or the deadly embrace (Problem 9-26). 

In deductive logic (where vertices represent axioms or statements and 
directed edges represent the theorems or derivation of one statement from 
others), a directed circuit implies circular reasoning and hence a fallacy. 

Thus it is important to know how to break these vicious cycles with a 
minimum of effort. In other words, find a smallest set of directed edges whose 
removal will render the given digraph G acyclic. 

Consider, for example, Fig. 9-2(a). The digraph contains several directed 
circuits: e t e 2 e 3 , e^e 6 e 3 , and e A e 5 e 2 e 3 . In this simple case, one can tell by 
inspection that the removal of edge e 3 will eliminate all directed circuits. This 
is the smallest set of edges whose removal makes the remaining digraph 
acyclic. Such a smallest set of edges whose removal destroys all directed 
circuit in a digraph G is known as a minimum-feedback arc set in electrical 
engineering. In general, a digraph may possess several minimum-feedback 
arc sets. Obtaining one such smallest set of edges may be called minimal 
decyclization of a digraph. 

Minimal decyclization of an arbitrary directed graph is at best a tedious 
affair. No simple method has been found so far. One method proposed in the 
literature (in 1969) uses Theorem 9-16, as follows: Make the adjacency matrix 
X upper triangular as much as possible by interchanging rows (and corre¬ 
sponding columns). The l’s remaining below (and on) the principal diagonal 

tThis identity can be seen by premultiplying both sides of (9-13) with the matrix 
(1 - X). 
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represent a minimum-feedback arc set. Another method can be 

1. Obtain all directed circuits in the given digraph G (using the result of 
Problem 9-11, say). 

2. Express each directed circuit as a Boolean sum of its edges. 

3. Take the Boolean product of all directed circuit expressions obtained 
in step 2. (The absorption laws of Boolean algebra are applied, such as 
a>a = a, a a ~ a, and a + ab — a.) 

4. Each of the resulting terms in the sum of the products represents a set 
of edges whose removal will destroy all directed circuits. Pick a term 
that consists of the smallest number of edges; this is a minimum- 
feedback arc set. 

To illustrate the procedure, let us consider the digraph in Fig. 9-2(a). All 
the directed circuits are 

e x e 2 e 3 , e 3 e 4 e 6 , and e 2 e 3 e 4 e 5 . 

Expressing these as a product of Boolean sums and multiplying out and 
simplifying, we get 

(e x + e 2 + e 3 ){e 3 + e 4 + e 6 )(e 2 -f e 3 + e 4 + e 5 ) 

= e x e 4 + e x e 6 e 3 + e 2 e 4 + e 2 e 6 + e 3 . 

Clearly, any one of these terms represents the set of edges whose removal 
would break all directed circuits in Fig. 9-2(a). The set with the smallest 
number of edges, {e 3 }, is the answer we were seeking. Both these methods are 
impractical for large digraphs. 


SUMMARY 

Most of the important and fundamental features of directed graphs were 
investigated in this chapter. We saw that there are two different aspects of 
digraphs: one in which their properties are similar to those of undirected 
graphs, such as planarity, thickness, spanning trees, fundamental circuits, and 
cut-sets; in their second aspect, digraphs have properties altogether different 
from those of undirected graphs, such as strong connectedness, arborescence, 
decyclization, and so on. 

The close relationship between binary relations and digraphs was explored. 
Applications of digraphs are virtually unlimited. Some important ones, such 
as in sequence generation in telecommunications and paired comparisons, 
were dealt with in detail. Others were simply mentioned. 
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Undoubtedly, a great deal more remains to be said. Additional properties 
of digraphs are presented in the form of problems at the end of this chapter. 
For the rest the reader must explore on his own, using the tools and results 
presented in the chapter. 
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PROBLEMS 

9-1. Prove that in any digraph the sum of the in-degrees of all vertices is equal to the 
sum of their out-degrees; and this sum is equal to the number of edges in the digraph. 
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9-2. Sketch all different (nonisomorphic) simple digraphs with 1, 2, and 3 vertices. 

9-3. Sketch all distinct (nonisomorphic) orientations of a complete graph of four 
vertices. Characterize each of the resulting digraphs in terms of binary relations. 
9-4. An irreflexive, asymmetric, transitive relation on a set is called a strict partial order. 
Give two examples of strict partial orders. Show that the digraph of a strict partial 
order is acyclic. Is the converse also true? 

9-5. The combinations of reflexivity, symmetry, and transitivity define eight (2 3 = 8) 
types of binary relations. Two such relations are equivalence and partial order. 
List the other six and sketch a digraph for each. 

9-6. Define and study the directed Hamiltonian circuit and semi-Hamiltonian circuit 
in a digraph. 

9-7. Prove that every edge in a digraph belongs either to a directed circuit or a directed 
cut-set. 

9-8. For an /z-vertex digraph, define an n by n accessibility (or reachability) matrix 
R = [rij\ as follows: 

r t j = 1, if there is a directed path of length one 
or more from i to j, 

= 0, otherwise. 

Devise a.method of obtaining R from the powers of the adjacency matrix X. (Note 
that this reachability matrix is slightly different from that in [9-3], because we do 
not include paths of zero length; i.e., r a is not necessarily one.) 

9-9. Is it possible for two nonisomorphic digraphs to have the same reachability 
matrix R ? Explain. 

9-10. Show that if R is the reachability matrix of a digraph G, the value of the /th entry 
in the principal diagonal of R 2 gives the number of vertices included in the strongly 
connected fragment containing the /th vertex. 

9-11. Show that the following procedure applied to the adjacency matrix X — [x/y] of a 
digraph G will yield the reachability matrix R of G. 

Step I: Let xn, x\j ,..., x\ m be the nonzero elements in the first row. Add the 
/th, yth,..., mth rows to the first row. Replace each nonzero element by a 1 
(Boolean sum). 

Step 2: Suppose that there are k additional nonzero elements p.q ,... , r generated 
in the first row as a result of step 1. Add the pth, q th,... , rth rows to the first row, 
and replace each nonzero element by a 1. 

Step 3: Repeat step 2 until no additional l’s can be added to the first row by this 
process. 

Step 4: Repeat the process on every row of X. 

9-12. Prove that an zz-vertex digraph is strongly connected if and only if the matrix M, 
defined by 

M = X -f X 2 + X 3 + • • • + X«, 
has no zero entry. X is the adjacency matrix. 

9-13. Prove that every Euler digraph (without isolated vertices) is strongly connected. 

Also show, by constructing a counterexample, that the converse is not true. 

9-14. List all 16 distinct directed Euler lines in Fig. 9-10. 

9-15. The Euler digraph in Fig. 9-10 is called the teleprinter diagram or the Good diagram 
for r = 4 [abbreviated as GD(4)]. Sketch and label GD(3) and GD(5). Find one 
directed Euler line and one directed Hamiltonian circuit in each. [Hint: GD(r) 
has 2 r_1 vertices and 2 r edges. A vertex in GD(r +1) corresponds to an edge 
in GD(r).] 
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9-16. An edge digraph or a line digraph L(G ) of a digraph G is defined as follows: 

1. There is exactly one vertex in L(G) for every edge e t in G. 

2. Whenever edges e t and ej (for a self-loop ej = e t ) are such that e ( is incident 
into a vertex v and ej is incident out of the same vertex v, an edge is drawn 
from the corresponding vi to vj in L(G). 

Show that GD(r + 1) is a line digraph of GD(r). 

9-17. If E\G\ is the number of Euler lines in an n-vertex Euler digraph G, show that 
2 n_ 1 • E | G | is the number of Euler lines in L(G). 

9-18. Prove that the number of directed Euler lines in GD(r) is 

22 T -'~r 

(Hint: Use the results of Problem 9-16 or use Theorem 9-13.) 

9-19. A drum rotates in discrete steps of 9 degrees, and you are to determine its precise 
position as follows. Divide the surface of the drum into k = 360 o /6 sectors, and 
paint each sector black or white (or conducting or nonconducting). Mount r con¬ 
secutive reading heads—each capable of detecting the color of the sector. 

Given some 6, express k and r in terms of 6. Sketch one such arrangement of 
colors on the drum for k — 16. 

9-20. What is the longest circular sequence formed out of three symbols (letters) x,y, 
and z such that no subsequence (words) of four symbols is repeated. Give one 
such sequence. [Hint: Form a regular Euler digraph with d~(v,) = d + (v t ) = 3, in 
the manner of Fig. 9-10.] 

9-21. Prove that any acyclic digraph G is an arborescence if and only if there is a vertex 
v in G such that every vertex is accessible from v. 

9-22. Prove that for every n > 3 there exists at least one acyclic complete tournament of 
n vertices. (Hint: Use induction.) 

9-23. Let R(G) be the reachability matrix of a digraph G, and let the vertices in G be 
ordered such that the sums of the rows in R(G) are nonincreasing; that is, 

2 nj > 2 r k j for every i < k. 
i =i j=i 

Show with this ordering of vertices in R(G) that digraph G is acyclic if and only 
if R(G) is an upper triangular matrix. 

9-24. Prove that a digraph G is acyclic if and only if every element on the principal 
diagonal of its reachability (or accessibility) matrix R(G) is zero. 

9-25. Prove that an acyclic digraph G of n vertices has a unique directed Hamiltonian 
path if and only if the number of nonzero elements in R((7) is n(n — l)/2. 

9-26. There are 15 computer programs that must be processed according to the following 
set of orders: 

1 > 2, 7, 13, 

2 > 3, 8, 14, 

3 > 9, 15, 

4 > 3, 

5 > 4, 11, 

6 > 5, 12, 

7 > 6, 
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8 > 7,9, 14, 

9 > 15, 

10 > 4,9, 

11 > 10 , 

12 > 11 , 

13 > 7, 12, 

14 > 13, 15, 

where 1 > 2, 7, 13 means that programs 2, 7, and 13 can be processed only after 
program 1 has been processed. Is it possible for the programs to be processed? 
If so, give a processing sequence. [Hint: Write X(G); derive R((7) from X(G) using 
Problem 9-11. Use Problem 9-23 to check if G is acyclic.] 

9-27. A digraph defined on the relation “is a parent of” is called a genetic digraph. 
(Genetic digraphs are us'eful in biology.) Investigate the properties of genetic 
digraphs. 

9-28. Use digraphs to solve the classical problem of “three cannibals and three edible 
missionaries seeking to cross a river in a boat that can hold at most two people, 
and all the missionaries and one of the cannibals can row the boat. Also, at no 
time should the cannibals outnumber the missionaries on either shore.” {Hint: 
Represent each state by a vertex and a possible transition by a directed edge.) 



■ ENUMERATION OF GRAPHS 


Arthur Cayley (1857), one of the founding fathers of graph theory, became 
interested in graph theory for the purpose of counting trees. The number of 
different trees of n vertices gave him the number of isomers of the saturated 
hydrocarbon with n carbon atoms, that is, C n H 2w+2 . Since Cayley’s classic 
paper, a great deal of work has been done on counting (also called enumera¬ 
tion) of different types of graphs, and the results have been applied in solving 
some practical problems. 

Some enumeration problems have already been introduced in earlier 
chapters. For example, in Chapter 2 the number of edge-disjoint Hamiltonian 
circuits in the complete graph of n vertices was discussed. Enumeration of 
trees in Section 3-6; finding all spanning trees in Section 3-9; the number of 
different edge sequences of length r between a specified pair of vertices 
(Theorem 7-8); Problems 7-20, 7-23, and 7-24; the number of different 
arborescences rooted at a given vertex in Chapter 9; and the number of 
different directed Euler lines in a digraph, also in Chapter 9, were all prob¬ 
lems of counting graphs. In this chapter a more unified approach to enumer¬ 
ating graphs will be taken. Certain enumerative techniques will be developed 
and used for counting certain types of graphs. A thorough exposition of 
Polya’s counting theorem, the most powerful tool in graph enumeration, is 
the central feature of this chapter. 

10-1. TYPES OF ENUMERATION 


All graph-enumeration problems fall into two categories: 

1. Counting the number of different graphs (or digraphs) of a particular 
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kind, for example, all connected, simple graphs with eight vertices and 
two circuits. 

2. Counting the number of subgraphs of a particular type in a given graph 
G, such as the number of edge-disjoint paths of length k between 
vertices a and b in G. 


The second type of problem usually involves a matrix representation of 
graph G and manipulations of this matrix. Such problems, although often 
encountered in practical applications, are not as varied and interesting as 
those in the first category. We shall not consider such problems in this 
chapter. 

In problems of type 1 the word “different” is of utmost importance and 
must be clearly understood. If the graphs are labeled (i.e., each vertex is 
assigned a name distinct from all others), all graphs are counted. On the other 
hand, in the case of unlabeled graphs the word “different” means non¬ 
isomorphic, and each set of isomorphic graphs is counted as one. 

As an example, let us consider the problem of constructing all simple 
graphs with n vertices and e edges. There are n(n — l)/2 unordered pairs of 
vertices. If we regard the vertices as distinguishable from one another (i.e., 
labeled graphs), there are 


f n(n — 1)\ 
2 

e 


( 10 - 1 ) 


ways of selecting e edges to form the graph. Thus expression (10-1) gives the 
number of simple labeled graphs with n vertices and e edges. 

Many of these graphs, however, are isomorphic (that is, they are the same 
except for the labels of their vertices). Hence the number of simple, unlabeled 
graphs of n vertices and e edges is much smaller than that given by (10-1). 

Among a collection of graphs, isomorphism is an equivalence relation 
(Problem 10-1). The number of different unlabeled graphs (of a certain type) 
equals the number of equivalence classes, under isomorphism, of the labeled 
graphs. For example, we have 16 different labeled trees of four vertices (Fig. 
3-15), and these trees fall into two equivalence classes, under isomorphism. In 
Fig. 3-15 the 4 trees in the top row fall into one equivalence class, and the 
remaining 12 into another. Thus we have only two different unlabeled trees of 
four vertices (Fig. 3-16). 

Let us now proceed with counting certain specific types of graphs. 


Theorem 10-1 

The number of simple, labeled graphs of n vertices is 


2n(n-l)/2 - 


(10-2) 
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Proof: The numbers of simple graphs of n vertices and 0, 1, 2,..., n(n — l)/2 
edges are obtained by substituting 0, 1, 2,... , n(n — l)/2 for e in expression (10-1). 
The sum of all such numbers is the number of all simple graphs with n vertices. 
Then the use of the following identity proves the theorem: 

(o) + (f) + (*)+■'• + (* - l) + (*) = 2 ‘- ■ 


10-2. COUNTING LABELED TREES 


Expression (10-1) can be used to obtain the number of simple labeled 
graphs of n vertices and n — 1 edges. Some of these are going to be trees and 
others will be unconnected graphs with circuits. Let us now prove Theorem 
3-10, which gives the number of trees. 

Theorem 3-10 

There are n n ~ 2 labeled trees with n vertices (« > 2). 

Proof of Theorem 3-10: Let the n vertices of a tree T be labeled 1,2,3,. .., n. 
Remove the pendant vertex (and the edge incident on it) having the smallest label, 
which is, say, a x . Suppose that b x was the vertex adjacent to a x . Among the remain¬ 
ing n — 1 vertices let a 2 be the pendant vertex with the smallest label, and b 2 be 
the vertex adjacent to a z . Remove the edge ( a 2 , b 2 ). This operation is repeated on 
the remaining n — 2 vertices, and then on n — 3 vertices, and so on. The process 
is terminated after n — 2 steps, when only two vertices are left. The tree T defines 
the sequence 

(b u b 2 ,...,b n _ 2 ) (10-3) 


uniquely. For example, for the tree in Fig. 10-1 the sequence is (1, 1, 3, 5, 5, 5, 9). 
Note that a vertex i appears in sequence (10-3) if and only if it is not pendant (see 
Problem 10-2). 

Conversely, given a sequence (10-3) of n — 2 labels, an w-vertex tree can be 



Fig. 10-1 Nine-vertex labeled tree, 
which yields sequence (1, 1, 3, 5, 5, 5, 9). 
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constructed uniquely, as follows: Determine the first number in the sequence 

(10-4) 

that does not appear in sequence (10-3). This number clearly is a x . And thus the 
edge (a x ,b x ) is defined. Remove b t from sequence (10-3) and a x from (10-4). In 
the remaining sequence of (10-4) find the first number that does not appear in the 
remainder of (10-3). This would be a 2 , and thus the edge ( a 2 , b 2 ) is defined. The 
construction is continued till the sequence (10-3) has no element left. Finally, 
the last two vertices remaining in (10-4) are joined. For example, given a sequence 

(4,4, 3,1,1), 

we can construct a seven-vertex tree as follows: (2, 4) is the first edge. The second 
is (5,4). Next, (4, 3). Then (3, 1), (6, 1), and finally (7, 1), as shown in Fig. 10-2. 

7 


Fig. 10-2 Tree constructed from se¬ 
quence (4, 4, 3, 1, 1). 

For each of the n — 2 elements in sequence (10-3) we can choose any one of 
n numbers, thus forming 

n n ~ 2 (10-5) 

(n — 2)-tuples, each defining a distinct labeled tree of n vertices. And since each tree 
defines one of these sequences uniquely, there is a one-to-one correspondence 
between the trees and the n n ~ 2 sequences. Hence the theorem. ■ 

Rooted Labeled Trees: In a rooted graph one vertex is marked as the root. 
For each of the n n ~ 2 labeled trees we have n rooted labeled trees, because any 
of the n vertices can be made a root. Therefore, 

Theorem 10-2 

The number of different rooted, labeled trees with n vertices is 

n n ~ 1 . (10-6) 

All rooted trees for n = 1,2, and 3 are given in Fig. 10-3. 

10-3. COUNTING UNLABFLED TREES 

The problem of enumeration of unlabeled trees is more involved and 
requires familiarity with the concepts of generating functions and partitions. 
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Fig. 10-3 Rooted labeled trees of one, two, and three vertices. 


Generating Functions 

One of the most useful tools in enumeration techniques is the generating 
function. A generating function f (x) is a power series 

f(x) — a 0 + a { x + a 2 x 2 + • • • (10-7) 

in some dummy variable jc. The coefficient a k of x k is the desired number, 
which depends on a collection of k objects being enumerated. For example, in 
the generating function 


(1 + xf = 



(10-8) 


the coefficient of x k gives the number of distinct combinations of n different 
objects taken k at a time. 

As another example, consider the following generating function: 


(1 — x) " = (1 4- x + x z + x 3 + • • •)” 

' n+k - V- 

V k ) 


s 

k = 0 


(10-9) 


The coefficient of x k in (10-9) gives the ways of selecting k objects from n 
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(distinct) objects with unlimited repetitions.! Note that the variable x has no 
significance. We are interested only in the coefficients. 

The generating function is used as a counting device and is therefore also 
called a counting series or an enumerator. An operation on a generating 
function is simpler than the corresponding operation on the sequence of 
coefficients a 0 , a x , a 2 , . ... For a detailed treatment of generating functions, 
the reader is referred to Chapter 2 in [3-11] or Chapter 3 in [10-1]. 

Partitions 

Another useful and important concept in enumerative combinatorics is 
that of a partition of a positive integer. When a positive integer p is expressed 
as a sum of positive integers 

P = - 1" A 3 4" * * ‘ + kg, 

such that A, > k 2 > A 3 > • • • > k g > 1, (10-10) 

the g-tuple is called a partition of integer p. For example, (5), (4 1), (3 2), 
(3 1 1), (2 2 1), (2 1 1 1), and (11111) are the seven different partitions of the 
integer 5. 

The integers, A/s, are called parts of the partitioned number p. It is con¬ 
venient to represent the repeated parts by means of exponents; for example, 
partition (2 111) is written as (2 l 3 ). 

The partitions of an integer p may be unrestricted or may have some 
restrictions on them, such as no repetition of any part [i.e., A,. 9 ^ A y in (10-10)], 
or no part greater than k is allowed. The number of partitions of a given 
integer p is often obtained with the help of some generating function. For 
example, the coefficient of x k in the polynomial 

(1 + *)(1 + * 2 )(1 + x 3 ). . . (1 + x>) (10-11) 

gives the number of partitions, without repetition, of an integer k < p (see 
page 111, [3-11]). 

Partitions are important to us because many graph-enumeration problems 
can be expressed in the form of partition problems. 

Rooted Unlabeled Trees 

Coming back to counting trees, let us recall that a rooted, unlabeled tree 
is one in which all vertices except the root are assumed alike. Let u n be the 

fThe result can be proved as follows: Let the n objects be labeled 1,2, 3and 
let a specific selection be a list of k integers a iy a 2 , .. •, a k arranged in nondecreasing 
order. The o,’s are not necessarily distinct. From this list we get a new list a\,a-i + 1, 
#3 + 2,..., ak + k — 1 by adding 0 to a\, 1 to « 2 , and so on. Each term in the new 
list is distinct. Thus every selection with unlimited repetitions can be identified uniquely 
as a selection of k distinct integers from integers 1,2, ..., n + k — 1 . 
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Fig. 10-4 Rooted tree decomposed into rooted subtrees. 


number of unlabeled, rooted trees of n vertices, and let u n (m) be the number of 
those rooted trees of n vertices in which the degree of the root is exactly m. 
Then 

n— 1 

K = £ W„0). 

m— 1 

Any rooted tree T of n vertices and with root R of degree m can be looked 
upon as composed of m rooted subtrees, each attached to R by means of an 
edge between its root and R. For example, in Fig. 10-4 an 11-vertex, rooted 
tree is composed of four rooted subtrees. 

In an n-vertex tree T the n — 1 vertices are distributed among the m 
subtrees, and thus T defines an ra-part partition of the number n — 1. Sup¬ 
pose that kj is the number of such subtrees (in T) with j vertices. Then 

k j -j- 2k 2 “t - 3Ar 3 -}- • • • T~ (w — Y)k„- j = n — 1 (10-12) 

and k i -f- T- k 3 -f- * ■ ■ T - k„_ j = m. (10-13) 

Note that Eqs. (10-12) and (10-13) represent an w-part partition of integer 
n — 1, in which integer i appears k t times (0 < k { < n — 1). 

In Fig. 10-4, for example, 

« — 11, m = 4, 

k j 1? ^2 ^5 ' 

k 3 = k 4 = k 6 = k n = • • • = k l0 = 0. 

Thus 2 kj = 4 

and YjJkj = 10. 

One can construct distinct rooted trees with j unlabeled vertices. Out of 
these trees we select kj trees to form subtrees of T. Since the same tree may 
appear more than once as a subtree of T, we have the problem of finding the 
number of ways of selecting k } objects out of w y objects with unlimited repeti- 
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tion. According to Eq. (10-9), this number is 


Uj + kj 



(10-14) 


Since each such selection can be made independently, the possible number of 
distinct trees for this specific partition is 


W„(^ 15^2’* • • ’ k„- 1) 


/«, + k x — 1 \/u 2 + k 2 — 1\ 

\ *, /l k 2 ) 


I u n -i + k n -i — 1 
\ k H . i 


5 


(10-15) 


where u n (k x , k 2 , , k„_ ,) stands for the number of ^-vertex, rooted trees 

corresponding to the partition 

2 ki 3 ki .. .{n — l)*”- 1 . 

Addition of u n (k x , k 2 ,. . . , k n ^ x ) over all possible partitions of n — 1 
yields the total number of spanning trees. That is, 




n - X 

s n 

partitions j= 1 
of n- 1 


(" + ,‘; " ') 


(10-16) 


What we have obtained in (10-16) is "a recurrence relation—a solution 
typical of many combinatorial problems. It gives u n , the number of rooted, 
unlabeled trees of n vertices, in terms of m,, u 2 , . . . , u n _ x . To use this relation, 
one builds up numerical tables in a step-by-step fashion. For example, 


u x — 1 


u 2 = 


u, 


<u x + i - r 

i 

<u i + i — i 

i i 


= i, 
+ 


m, + 2 - r 

2 


To evaluate w 4 , we first have to find all partitions of integer 3. These are 

(3), (2,1), and (1,1,1). 

The sum of the respective terms contributed by these partitions is 


w 4 = 


T 1 — 1 
, 1 


+ 


< u 2 + 1 — 1 \( u \ H - 1 — 1 \ /w, + 3 

> 1 1 + 3 


r 


Similarly, to evaluate u s we observe that the integer 4 has five different 
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Fig. 10-5 Rooted, unlabeled trees of one, two, three, and four 
vertices. 


partitions, and these are 

(4), (3,1), (2,2), (2,1,1), and (1, 1, 1, 1). 

The number of rooted trees corresponding to each of these five partitions is 
obtained using (10-15). The sum yields u s : 


_ j u i + 3 \ , ( u i + 1\ (u 2 + 1\ 

! ~( 4 j + ( 2 )“ 2 + ( 2 ) + " 3 + “ 4 

= 14-1 + 1+ 2 + 4 = 9. 


And so on. In Fig. 10-5, all rooted, unlabeled trees of one, two, three, and 
four vertices are shown. 

Clearly, computation of u n for, say, n = 20, using (10-16) is extremely 
tedious and involved. It requires obtaining all possible partitions of integer 
19 (there are 490 partitions of 19), computing w 19 , « 18 , . . . , w 2 , evaluation 
of the combinatorial product term 


n 



i 


for each partition, and then taking the sum of all 490 such terms. 

Counting Series for u n : To circumvent some of these difficulties in com¬ 
putation of let us find its counting series (i.e., the generating function) 
u(x), where 

u(x ) = u x x + u 2 x 2 + u 3 x 3 + • • • 

oo 

= £ ux (10-17) 

n-\ 

oo 

= r ^ u„x n ~ 1 . 

n — 1 


Substitution of (10-16) in (10-17) and substitution of n — 1 by its partition as 
in (10-12) yields 
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«(*) = E 


, u x + k x — 1\ / u 2 -\- k 2 1 


n= 1 partitions 
of n- 1 


ky 


I** 1 I 


.2 kt 


f Un-l + k n - [ — r 

^ k n -\ > 


(10-18) 


An- 


Observing that every sequence of positive integers forms a partition of 
some integer, (10-18) can be rearranged as 


u(x) = x 


, £ 

Lfc< = 0 


j -T k j — 1 

\ k t 


\x k ' S 

_ L/t 2 =o 


f u 2 + k 2 — r 


x 2 * 1 


oo 

E 1 

(u„- 1 + k n -y - 

\ 1 )k(n- 1) 

— kn-l — 0 

\ k n —! 

I J 


Substituting the identity 


(1 - x m )~ p = 2 

j = 0 


f p + i - i' 


,mj 


in (10-19) gives us the desired counting series. That is, 
u(x) — x(l — Ar) _ “‘(l — x 2 )"“ 2 (1 — x 3 )~ 

oo 

= x n o - ^ r ) _ur - 


r= 1 


(10-19) 


(10-20) 


Calculation of u n from (10-20) involves building up a table of u t for 
i = 1, 2, 3, . . ., n — 1, and substituting the values in (10-20). The first 10 
terms in the series (10-20) are 


u(x ) = x + x 1 + 2x 3 + 4x 4 + 9x 5 + 20x 6 + 48x 7 
+ 1 15x 8 + 286x 9 + 719x 10 + 


(10-20a) 


The reader should verify (10-20a) himself and extend the expansion 
through another 10 terms. 

The generating function w(x) can be expressed in an alternative form as 
follows: 

Taking the natural logarithms of both sides of Eq. (10-20), we get 


In u(x) = In x — Yj u r l n 0 ~~ * r ) 

r= 1 


= In X + X) u r £ — 

r = 1 1=1 l 


oo 1 

= lnx + i;4- 
M i 


oo 


E U r x '‘ 

r= 1 


= In x + £ 4-w(*9* 

i=i i 
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Therefore, 

u( x) = xe 2r=1 <1/i) “ u ' ) . (10-21) 

Form (10-21) is due to George Polya, whereas (10-20) is Arthur Cayley’s. 

To obtain the generating function for (free) unlabeled trees from rooted 
unlabeled trees, one can look at a (free) tree as composed of subtrees, rooted 
at some sort of central vertex distinct from all other vertices in the tree. For 
this, we shall use the concept of centroid in a tree. 

Centroid 

In a tree T, at any vertex v of degree d, there are d subtrees with only 
vertex v in common. The weight of each subtree at v is defined as the number of 
branches in the subtree. Then the weight of the vertex v is defined as the 
weight of the heaviest of the subtrees at v. A vertex with the smallest weight 
in the entire tree T is called a centroid of T. 

Just as in the case of centers of a tree (Section 3-4), it can be shown that 
every tree has either one centroid or two centroids. It can also be shown that 
if a tree has two centroids, the centroids are adjacent. In Fig. 10-6 a tree with 
a centroid (called a centroidal tree ) and a tree with two centroids (called a 
bicentroidal tree ) are shown. The centroids are shown enclosed in circles, and 
the numbers next to the vertices are the weights. 

Free Unlabeled Trees 

Let t'(x) be the counting series for centroidal trees, and t"(x) be the 
counting series for bicentroidal trees. Then t(x), the counting series for all 
(unlabeled, free) trees, is the sum of the two. That is, 

t(x) = t\x) + t"(x). (10-22) 

To obtain t"{x), observe that an ^-vertex bicentroidal tree can be regarded 
as consisting of two rooted trees each with n/2 — m vertices, and joined at 
their roots by an edge. (A bicentroidal tree will always have an even number of 
vertices; why?) Thus the number of bicentroidal trees with n = 2m vertices is 




(a) Centroidal Tree (b) Bicentroidal Tree 

Fig. 10-6 Centroid and bicentroids. 
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= = «-(«» + 0 
_ ^ U m ( U m ~t~ 0 jy2m 

in^ 1 2 

1 oo 1 oo 

= -y S w m* 2m + -y 2 («m^) 2 (10-23) 

Z m = 1 Z m =i 

= ^-w(x 2 ) + \ f] (w m X m ) 2 . 

Z Z m = I 

The number of vertices, n, in a centroidal tree can be odd or even. If n is 
odd, the maximum weight the centroid could have is \{n — 1). This maximum 
is achieved only when the tree consists of a path of n — 1 edges. On the other 
hand, if n is even and the tree is centroidal, the maximum weight the centroid 
could possibly have is n — 2). This maximum is achieved when the degree 
of the centroid is three, and one of the subtrees consists of just one edge. 

Thus, regardless whether n is odd or even, it is clear that an w-vertex (free) 
centroidal tree can be regarded as composed of several rooted trees, rooted at 
the centroid, and none of these rooted trees can have more than \_(n — 1)/2J 
edges, where |_.xj denotes the largest integer no greater than x. In view of 
this observation, an involved manipulation of Eq. (10-21) leads to the fol¬ 
lowing (for missing steps see [10-3]): 

/'(*) = w(x) — \u\x) — if] ( u m x m ) 2 . (10-24) 

^ £ m—\ 

Adding (10-23) and (10-24), we get the desired counting series: 

t(x) = u(x ) — 'u 2 (x ) — u(x 2 )j- (10-25) 

This relation, which gives the tree-counting series in terms of the rooted-tree 
counting series, was first obtained by Richard Otter in 1948 and is known 
as Otter’s formula. The first 10 terms of (10-25) are 

t(x) = x + x 2 + jc 3 + 2jc 4 + 3.x 5 + 6x 6 + 1 Xx 1 
+ 23jc 8 + 47.x 9 + 106x 10 + • • •. 

The reader is encouraged to extend it by another 10 terms. The first 26 terms 
of both u(x) and t(x) are given in Riordan’s book [3-11], page 138. 

By now you must have the impression that enumeration of graphs is an 
involved subject. And indeed it is. So far we have enumerated only four types 
of graphs—rooted and free trees, both labeled and unlabeled varieties. It is 


given by 


and therefore 
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difficult to proceed further without some additional enumerative tool. This is 
provided by a general counting theorem due to Polya. We shall first state 
and discuss Polya’s theorem and then show how it can be applied for count¬ 
ing graphs. 


10-4. POLYA'S counting theorem 

To understand Polya’s theorem, we need a few additional concepts in 
combinatorial theory. In this section we shall first define a permutation and 
see how it can be represented in different ways. Then we shall show how a 
set of permutations P can form a group (called a permutation group) under a 
binary operation called composition. Then we shall introduce a polynomial 
called the cycle index of a permutation group P. Finally, we shall show that 
all mappings/.’s from a domain D to a range R (both D and R being finite) 
are divided into equivalence classes by any permutation group P acting on the 
domain D. 

After introducing these concepts we shall define figure-counting series and 
configuration-counting series. And this will be followed by the statement of 
the celebrated theorem of Polya, which expresses the configuration-counting 
series in terms of the figure-counting series and the cycle index of the permu¬ 
tation group. The statement of the theorem will be followed by discussion and 
two illustrative examples. 

If Polya’s theorem and the buildup to it do not appear very intuitive to 
you, don’t worry; you are not alone. What is important is to understand the 
theorem and be able to use it for counting different types of graphs. 

Permutation 

On a finite set A of some objects, a permutation n is a one-to-one mapping 
from A onto itself. For example, consider a set [a, b, c, d}. A permutation 

(ab c d\ 

71 > ~ \bdca) 

takes a into b, b into d , c into c, and d into a. Alternatively, we could write 

n^a) = b, 
n x (b) = d, 

7r,(c) = c, 
n x {d) — a. 

The number of elements in the object set on which a permutation acts is 
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called the degree of the permutation. The degree of n x in the above example is 
four. 

A permutation can also be represented by a digraph, in which each vertex 
represents an element of the object set and the directed edges represent the 

mapping. For example, the permutation n x (abed'' 

grammatically by Fig. 10-7. 


(ft dc a) * s re P resente d dia- 


b 




Fig. 10-7 Digraph of a permutation. 


Observe that the in-degree and the out-degree of every vertex in the 
digraph of a permutation is one. Such a digraph must decompose into one or 
more vertex-disjoint directed circuits (why?). This suggests yet another way of 
representing a permutation—as a collection of the vertex-disjoint, directed 

circuits (called the cycles of the permutation). Permutation (^ ^ ^ can thus 

be written as ( a ft d)(c). This compact and popular representation is called the 
cyclic representation of a permutation. The number of edges in a permuta¬ 
tion cycle is called the length of the cycle in the permutation. 

Often the only information of interest about a permutation is the number 
of cycles of various lengths. A permutation n of degree k is said to be of type 
(cr,, cr 2 ,. . . , a k ) if n has a t cycles of length /for / = 1,2 ,... ,k. For example, 
permutation {a ft d)(c) is of type (1,0,1,0) and permutation (a ft f)(c)(d e h)(g) 
is of type (2, 0, 2, 0, 0, 0, 0, 0). Clearly, 


lerj + 2(j 2 + 3cr 3 T • • • -f- kc t k — k. (10-26) 

Another useful method for indicating the type of a permutation is to 
introduce k dummy variables, say, y x ,y 2 , ■ • ■ ,y, fe » and then show the type 
of permutation by the expression 

yVyV ■■■yV- (10-27) 

Expression (10-27) is called the cycle structure of 7i. For example, the cycle 
structure of the eight-degree permutation (a ft f){c){d e h){g) is 

y\ y 2 yi yl y°s yl y? y\I = y\ yb 

Note that the dummy variable y. has no significance except as a symbol to 
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which subscripts (indicating the lengths) and exponents (indicating the num¬ 
ber of cycles) are attached. Two distinct permutations (acting on the same 
object set) may have the same cycle structure (page 149 in [10-1]). 

So far we have discussed only the representation and properties of a 
permutation individually. Let us now examine a set of permutations col¬ 
lectively. 

On a set A with k objects, we have a total of k ! possible permutations— 
including the identity permutation, which takes every element into itself. For 
example, the following are the six permutations on a set of three elements 
[a, b, c}: 

(a)(b)(c\ (a b)(c), (a c)(b), {a)(b c ), {a b c), (a c b). 

Their cycle structures, respectively, are 

y\ > y iT 2’ yiy y 3, y 3 - ( 10-28) 

Composition of Permutations 


Consider the two permutations n l and n 2 on an object set [1, 2, 3, 4, 5}: 


_ (l 2 3 45\ 
n ' ~ \2 1 4 5 3/ 


and 



A composition of these two permutations n 2 n x is another permutation 
obtained by first applying n x and then applying n 2 on the resultant. That is. 


Thus 


n 2 n x {\) = n 2 (2) = 4, 
n 2 J r,(2) = 7r 2 ( 1) = 3, 
7t 2 n x {7>) = 7r 2 (4) = 2, 
n 2 Ki( 4 ) = tt 2 (5) = 5, 
7r 2 7r,(5) = 7i 2 (3) = 1. 



Thus among a collection of permutations on the same object set, composition 
is a binary operation. 


Permutation Group 

A collection of m permutations P = {n x ,n 2 , , n m ] acting on a set 


A = [a„ a 2 ,..., a k ] 
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forms a group under composition, if the four postulates! of a group, that is, 
closure, associativity, identity, and inverse (see Section 6-1), are satisfied. 
Such a group is called a permutation group. For example, it can be easily 
verified that the set of four permutations 

{(a)(b)(c)(d), (a c)(b d), {abed), (a d c b)} (10-29) 

acting on the object set f a, b, c, d] forms a permutation group. 

The number of permutations m in a permutation group is called its order, 
and the number of elements in the object set on which the permutations are 
acting is called the degree of the permutation group. In the example just cited, 
both the degree and order of the permutation group is four. It can be shown 
that the set of all k\ permutations on a set A of k elements forms a permuta¬ 
tion group. Such a group, of order A:! and degree k, is called the full symmetric 
group, S k . 

Cycle Index of a Permutation Group 

For a permutation group P, of order m, if we add the cycle structures of 
all m permutations in P and divide the sum by m, we get an expression called 
the cycle index Z{P) of P. For example, the cycle index of S 3 , the full sym¬ 
metric group of degree three, according to (10-28) comes out to be 

Z(S 3 ) = + 3 y,y 2 + 2y,). (10-30) 

Similarly, the cycle index of the permutation group (of degree four and order 
four) shown in (10-29) is 

+ y\ + 2y 4 ). 

Since the cycle index is the most important concept in this section, let us 
illustrate it with another example. Let us find Z(S 4 ). 

Table 10-1 gives the different types of permutations possible in S 4 , the full 
symmetric group of degree four. 

Table 10-1 is easy to understand and to construct. For example, we have 
six permutations of type (2, 1, 0, 0) on the object set [a, b, c, d}: 

(a) (b)(c d), (a)(c)(b d), ( a)(d)(b c), 

(b) (c)(a d), (b)(d)(a c ), (c)(d)(a b). 

fin fact, it can be shown that if a collection of permutations is closed with respect to 
composition, the remaining three postulates are automatically satisfied (Problem 10-4). 
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Permutation Type 

Number of Such 
Permutations 

Cycle Structures 

(4,0,0, 0) 

1 

A 

(2,1,0, 0) 

6 

y\yi 

(1,0,1,0) 

8 

y\y* 

(0, 2, 0,0) 

3 

y\ 

(0,0, 0,1) 

6 

y 4 


Table 10-1 


To get the cycle index of S 4 from Table 10-1, we multiply the corresponding 
entries in the second and third columns, add the products, and then divide 
the sum by 4!, the order of the group. Thus 

Z (S 4 ) = 2^0! + 6 y\y 2 + 8 \y i y 3 + 3 y\ + 6 y 4 ). (10-31) 

To display the variables involved, the cycle index of a permutation group 
P is often written as 

Z(P) = Z(P;y t ,y 2 , . . . , y k ). 

It is evident that computation of the cycle index of an arbitrary permuta¬ 
tion group can become quite involved and laborious. There are certain 
groups, such as S k , whose cycle indices have been derived in closed forms. 
These are related to the partitions of integer k satisfying Eq. (10-26). For 
more on methods of obtaining cycle indices, the reader should see [10-1] and 
[1-5]. 


Cycle Index of the Pair Group 


When the n vertices of a graph G are subjected to permutation, the 
n(n — l)/2 unordered vertex pairs also get permuted. For example, let 
V = {a, b, c, d ] be the set of vertices of a four-vertex graph. The permutation 



a b c d\ 
d b a c) 


on the vertices induces the following permutation on the six unordered vertex 


pairs: 


/ ab ac ad be bd cd\ 
\db da dc ba be ac) 


The diagrams of permutation ft and the induced permutation are shown in 
Fig. 10-8. 

Notice that a y t y 3 permutation on the vertex set induces a y\ permutation 
on the vertex-pair set. 
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Fig. 10-8 Permutation on vertex set and the induced permutation 
on vertex-pair set. 


Similarly, each of the n\ possible permutations on the n vertices of a 
graph results in some permutation of the n(n — l)/ 2 , unordered, vertex pairs 
(or n(n — 1) ordered vertex pairs, in the case of digraphs). Furthermore, it 
can be shown that if a set of permutations on the vertices forms a group, the 
induced set of permutations on the pair of vertices will also form a group 
(Problem 10-5). For instance, the full symmetric group S„ on n vertices of a 
graph induces a group R n of n\ permutations on the pairs of vertices.! Such 
an induced group is called the pair group R n . Let us work out the pair group 
R 4 induced by S 4 , the full symmetric group on the vertices of a four-vertex 
graph. 

The identity permutation on the four vertices of a graph produces an 
identity permutation on the six pairs of vertices. A permutation with two 
cycles of length one and one cycle of length two produces two cycles of length 
one and two cycles of length two. And so on. The cycle structures of per¬ 
mutations in tS '4 and the corresponding cycle structures of the induced 
permutations on the pairs of vertices are shown in Table 10-2. 


Term in Z(S 4 ) 

Induced Term in Z(R^) 

No. of Permutations 

A 

y\ 

1 

y\yi 

y\y\ 

6 

y\y* 

y\ 

8 

y\ 

y\y\ 

3 

T4 

yiy* 

6 


Table 10-2 


Therefore, the cycle index of the pair group R 4 (induced on the pairs of 
vertices by S' 4 ) is 

Z(R 4 ) = j^{y\ + 9 y\y\ + 8 y\ -|- 6y 2 yJ. (10-32) 

For a general expression for Z(R n ) see [10-2]. 

•{•Except for n = 2, in which case the number of possible permutations on the pair is 1, 
rather than 2 . 
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Equivalence Classes of Functions 

As a further preliminary to describing Polya’s theorem, let us introduce 
some additional concepts. Consider two sets D and R, with the number of 
elements |Z)| and \R\, respectively. Let/be a mapping (or function) which 
maps each element d from domain D to a unique image f(d ) in range R. Since 
each of the | D \ elements can be mapped into any of the \R \ elements, the 
number of different functions from D to R is |/?| 101 . 

Now let there be a permutation group P on the elements of set D. Then 
define two mappings f x and/ 2 as P-equivalent if there is some permutation n in 
P such that for every d in D we have 

(10-33) 

That the relationship defined by (10-33) is an equivalence relation can be 
shown as follows: 

1. Since P is a permutation group, it contains the identity permutation, 
and thus (10-33) is reflexive. 

2. If P contains permutation n , it also contains the inverse permutation 
n~ x . Therefore, the relation is symmetric also. 

3. Furthermore, if P contains permutations n x and n 2 , it must also contain 
the permutation n x n z . This makes P-equivalence a transitive relation. 

Since an equivalence relation divides a set into equivalence classes, all 
mappings from D to R are divided into equivalence classes by a permutation 
group P acting on set D. As an example, let D = ( a , b, c} and R — { 5 , t}. 
There are 2 3 = 8 mappings/i,/ 2 , • - • ,/g from D to R, as shown in Table 
10-3. 



/. 

fl 

h 

u 

fs 

h 

fl 

fs 

m 

5 

s 

s 

t 

s 

t 

t 

t 

m 

s 

s 

t 

s 

t 

s 

t 

t 

m 

s 

t 

s 

s 

t 

t 

s 

t 


Table 10-3 


Now suppose a permutation group P = {(<z)(Z>)(c), (a b c), (a c b )} is acting on 
D. The reader can verify that the eight mappings in Table 10-3 will be divided 
into four equivalence classes. They are 


[/,}, [fiJ{fsJ'M [/si- 
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Polya's Counting Theorem 

Let us consider two finite sets, domain D and range R, together with a 
permutation group P on D. To each element p e R let us assign a quantity 
w[p ] and call it the content (or weight ) of the element p. The weight w[p] can 
be a symbol or a real number. A mapping/from D to R can be described by 
a sequence of | D \ elements of set R such that the /th element in the sequence 
is the image of the /th element of set D under/. Therefore the content W(f) of 
a mapping/can be defined as the product of the contents of all its images. 
That is, 

wn = n *i/m 

deD 

Clearly, all functions belonging to the same equivalence class defined by 
(10-33) have identical weights. Therefore, we define the weight of an entire 
equivalence class (of functions from domain D to range R) to be the (com¬ 
mon) weight of the functions in this class. Our problem is to count the num¬ 
ber of equivalence classes with various weights, given D, R, permutation 
group P on D, and weights w[p] for each p e R. This is exactly what Polya’s 
counting theorem gives. 

In Polya’s terminology, elements p of set R are called figures, and func¬ 
tions / from D to R are called configurations. Often the weights of the ele¬ 
ments of R can be expressed as powers of some common quantity x. In that 
case the weight assignment to elements of set R can be neatly described by 
means of a counting series A(x) 

A(x) = f) a^, (10-34) 

9 = 0 

where a q is the number of elements in set R with weight x q A Likewise, the 
number of configurations can be expressed in terms of another series, called 
configuration counting series B{x), such that 

B(x) = ± b m x m , (10-35) 

m= 0 

where b m is the number of different configurations having weight x m . Now we 
can state the following powerful result known as Polya’s counting theorem. 

Theorem 10-3 

The configuration-counting series B(x) is obtained by substituting the figure¬ 
counting series A(x‘) for each y t in the cycle index Z{P\y if y 2 ,. . . , y k ) of the 

f If the content assigned to figures cannot be expressed as powers of a single quantity x, 
then the figure-counting series will be a multinomial in different variables, rather than in 
just one variable x. 
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permutation group P. That is, 

B(x) = Z(P; 2 a q x Q , 2 a q x u , 2 <*q xig , • • •, 2 o g x k<3 ). (10-36) 

The proof of Polya’s theorem, although not complicated, is not particularly 
illuminating and is therefore left out. The reader can find it in [10-1], page 157. 
Our interest is mainly in the application of the theorem; let us illustrate it with 
some examples. 

Example 1: Suppose that we are given a cube and four (identical) balls. In 
how many ways can the balls be arranged on the corners of the cube? Two 
arrangements are considered the same if by any rotation of the cube they can 
be transformed into each other. 

The answer is seven, as can be seen by inspection in Fig. 10-9. In Polya’s 
terms the domain D is the set of the eight corners of the cube, and the range 









Fig. 10-9 Attaching four balls to corners of a cube. 


R consists of two elements (i.e., figures), “presence of a ball” or “absence of a 
ball,” with contents x' and x°, respectively. The figure-counting series is 

A{x) = 2 ] o q x q = a 0 x° 4- a x x x = 1 + x, (10-37) 

ff "0 

since a 0 , the number of figures with content 0, is one, and a y , the number of 
figures with content 1, is also one. The configurations are 2 8 = 256 different 
mappings that assign balls to the corners of the cube. The permutation 
group P on D is the set of all those permutations that can be produced by 
rotations of the cube. These permutations with their cycle structures are 

1. One identity permutation. Its cycle structure is >>?. 

2. Three 180° rotations around lines connecting the centers of opposite 
faces. Its cycle structure is y\. 
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3. Six 90° rotations (clockwise and counterclockwise) around lines con¬ 
necting the centers of opposite faces. The cycle structure is y\. 

4. Six 180° rotations around lines connecting the midpoints of opposite 
edges. The corresponding cycle structure is y 4 - 

5. Eight 120° rotations around lines connecting opposite corners in the 
cube. The cycle structure of the corresponding permutation is y\y\- 

The cycle index of this group consisting of these 24 permutations is, 
therefore, 


Z(P) = ±(y>, + 9vi + 6yi + 8 \y\y\). (10-38) 

Using Polya’s theorem, we now substitute the figure-counting series, that is 
1 + x for y lt 1 + x 2 for y 2 , 1 + x 3 for y 3 , and 1 + x 4 for y 4 . This yields the 
configuration-counting series. 

B(x) = 1 + x + 3x 2 + 3x 3 + 7x 4 + 3x 5 + 3x 6 + x 7 -f x 8 . (10-39) 

The coefficient of x 4 in B(x ) gives the number of P-inequivalent configurations 
of content x 4 (i.e., with four balls). This verifies the answer obtained by 
exhaustive inspection in Fig. 10-9. 

The total number of P-inequivalent configurations (with contents 
x°, x 1 , x 2 , . . . , x 8 ) is obtained by adding all coefficients in (10-39), which is 
23. It may be observed that this is the number of distinct ways of painting 
the eight vertices of a cube with two colors (one color corresponds to the 
“presence of a ball” and the other with the “absence of a ball”). 

Example 2: In example 1 we were given four identical balls. Now suppose 
that we are given two red balls and two blue balls, and are again asked to find 
the number of distinct arrangements on the corners of the cube. Clearly, D , 
P, and Z(P) will remain the same as they were in example 1. Only the range R 
and the figure-counting series A{x) will change. The range will contain three 
elements: (1) presence of no ball, (2) presence of a red ball, and (3) presence 
of a blue ball. Choosing x to indicate the presence of a red ball and x' to 
indicate the presence of a blue ball, the three elements in the range mentioned 
above will have the contents x°x'°, x^' 0 , and x°x' 1 , respectively. Therefore 
the figure-counting series is 

A(x, x') — x°x' 0 + x'x' 0 + x°x M = 1 + x + x\ 

Substituting this figure-counting series in (10-38), we get the configuration- 
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counting series 

B(x, x') = ^[(1 +x + x'f + 9(1 + x 2 + x' 2 ) 4 +6(1 +x 4 + x‘ 4 ) 2 

+ 8(1 + x + x') 2 (l +x 2 + x' 2 ) 2 ] 

= 1 + x + x' + 3x 2 + 3x' 2 + 3xx' + 3x 3 + 3x' 3 
+ 7x 2 x' + 7xx' 2 + 7x 4 + 7x' 4 + 13x 3 x' + 13xx' 3 
+ 22x 2 x' 2 + 3x 5 + 3x' s + \3x 4 x + 13xx' 4 

(10-40) 

+ 24x 3 x' 2 + 24x 2 x' 3 + 3x 6 + 3x' 6 + 7x 5 x ' 

+ 7xx' 5 + 22x 4 x' 2 + 22x 2 x' 4 + 24x 3 x' 3 + jc 7 + x' 7 
+ 3x 6 x' + 3xx' 6 + 7x s x' 2 + 7x 2 x' 5 + 13jc 3 x' 4 + 13jc 4 x' 3 
+ x 8 + x '* + x 1 x' + xx' 1 + 3x 6 x' 2 + 3x 2 x' 6 + 3x 5 x' 3 
+ 3x 3 x' 5 + 7x 4 x' 4 . 

The coefficient of x r x' b in (10-40) is the number of distinct arrangements with 
r red balls, b blue balls and 8 — r — b corners with no balls. The number of 
arrangements with two red and two blue balls is, therefore, 22. 

For some other non-graph-theoretic examples of the applications of 
Polya’s theorem, the reader should work out Problems 10-10, 10-11, 10-14, 
and 10-15. Let us now return to the counting of graphs. 

10-5. GRAPH ENUMERATION WITH POLYA'S 
THEOREM 

Enumeration of Simple Graphs: Let us consider the problem of counting 
all unlabeled, simple graphs of n vertices. Any such graph G can be regarded 
as a mapping (i.e., configuration) of the set D of all ^ n(n — 1) unordered 
pairs of vertices (for digraphs n(n — 1) pairs of vertices). Range R consists of 
two elements 5 and t, with contents x 1 and x°, respectively. If a vertex pair 
is joined by an edge in G, the vertex pair maps into s , an element with content 
x 1 ; otherwise, into t, an element with content x° = 1. Thus the figure-count¬ 
ing series is 

A(x) — XI a qX 9 = 1 + x. 

The relevant permutation group in this case is R n , the group of permuta¬ 
tions on the pairs of vertices induced by S„ (the full symmetric group on the n 
vertices of the graph).t Therefore, the configuration-counting series is 

fBecause in an unlabeled graph, all n vertices are indistinguishable. Were we to count 
labeled graphs the permutation group would have consisted of only the identity permuta¬ 
tion. Substitution of 1 + x in its cycle index would have yielded the simple result of expres¬ 
sion (10-1). 
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obtained by substituting 1 d- x for y n 1 + x 2 for y 2 , 1 + x 3 for y 3 , and so on 
in Z(RJ. Some specific cases are 

(1) For n = 3, 

Z(R,) = + 3 y,y, + 2y,). 

Therefore, the configuration-counting series is 

B(x) = ■![(! + xY + 3(1 H- xXl + x 2 ) + 2(1 + x 1 )] 

— Id x d- x 2 d - x 3 . 

The coefficient of x‘ in B(x ) is the number of configurations with content x' . 
The content of a configuration here is the number of edges in the correspond¬ 
ing graph. Thus the number of nonisomorphic simple graphs of three vertices 
with 0, 1,2, and 3 edges is each one. This is how it should be, as shown in Fig. 
10-10. 

(2) For n = 4, the cycle index Z(/? 4 ) is given in (10-32). Substituting 
1 d- x'' for y ; in (10-32), we get 

B(x) = Jj[(l + A") 6 + 9(1 + x) 2 ( 1 + x 2 ) 2 + 8(1 + X 1 ) 2 

+ 6(1 + x 2 Xl + x 4 )] 

= 1 |- x d- 2x 2 d 3x 3 d- 2x 4 + x 5 + x 6 . 


In (10-41) the coefficient of x r gives the number of simple graphs with four 
vertices and r edges. The validity of series (10-41) is verified in Fig. 10-11. 







• • 




Fig. 10-10 Simple unlabeled graphs of 
three vertices. 









Fig. 10-11 Simple unlabeled graphs of four vertices. 
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(3) For n — 5, the cycle index Z(R S ) is given in Problem 10-9. Substituting 
1 + x‘ for y. in Z(/? 5 ), we get the counting series B(x ) for simple graphs of 
five vertices, as follows: 


B(x) 


■- j^[(i + xy° + io(i + x)\i + x 2 y + 20(1 + *xi + * 3 ) 3 

+ i5(i + x)\i + x 2 y + 30(i + * 2 )(i + x*y 
+ 20(1 + JtXl + * 3 X1 + X 6 ) + 24(1 + x 5 ) 2 ] (10 ' 42) 

: 1 + x + 2x 2 + 4x 3 + 6x 4 + 6x 5 + 6x 6 + 4x 7 + 2x 8 
+ x 9 + x 10 . 


Again, for each r the coefficient of x r in (10-42) gives the number of simple 
graphs of five vertices and r edges. 

The number of simple, unlabeled graphs with n vertices for any n can be 
counted similarly. 

Enumeration of Multigraphs: Suppose that we are interested in counting 
multigraphs of n vertices, in which at most two edges are allowed between a 
pair of vertices. 

In this case the domain and the permutation group are the same as they 
were for simple graphs. The range, however, is different. A pair of vertices 
may be joined by (1) no edge, (2) one edge, or (3) two edges. Thus range R 
contains three elements, say, s, t, u, with contents x°, x 1 , and x 2 , respectively; 
that is, x' indicates the presence of i edges between a vertex pair, for i = 
0, 1,2. Threfore, the figure-counting series becomes 


1 + x + x 2 . 


(10-43) 


Substitution of 1 + x r + x 2r for y r in Z(R n ) will yield the desired configura¬ 
tion-counting series. For n — 4, using the cycle index from (10-32), we get 

^[(1 + x + x 2 ) 6 + 9(1 + x + x 2 ) 2 (l + x 2 + x 4 ) 2 + 8(1 + x 3 + x 6 ) 2 


+ 6(1 + x 2 + x 4 Xl + x 4 + x 8 )] 

= 1 + x + 3x 2 + 5x 3 + 8x 4 + 9x 5 + 12x 6 + 9x 7 + 8x 8 
+ 5x 9 + 3x 10 + x 11 + x 12 . 


(10-44) 


The coefficient of x‘ in (10-44) is the number of distinct, unlabeled, 
multigraphs of four vertices and i edges (such that there are at most two 
parallel edges between any vertex pair). For example, the coefficient of x 3 is 
5, and these five multigraphs are shown in Fig. 10-12. 

Instead of allowing at most two parallel edges between a pair of vertices, 
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Fig. 10-12 Unlabeled multigraphs of four vertices, three edges, and 
at most two parallel edges. 


had we allowed any number of parallel edges the figure-counting series 
would be the infinite series 

A(x ) = 1 + x + x 2 + x 3 + • • • = -j— - (10-45) 

Enumeration of Digraphs: For enumerating digraphs we have to consider 
all n(n — 1) ordered pairs of vertices as constituting the domain. The relevant 
permutation group will consist of permutations induced on all ordered pairs 
of vertices by S n . The cycle index of this permutation group, M n , can be 
obtained in the same fashion as was done in the case of R n . For example, for 
n = 4, Table 10-4 gives the terms in Z(M „) induced by each term in Z(S n ). 


Term in Z(S 4 ) 

Induced Term in Z(M 4 ) 

A 

y\2 

y\yi 

y\y\ 


y\ 

y\ 

y\ 

y\ 

y\ 


Table 10-4 


Therefore, the cycle index is 

Z(M t ) = JjW 2 + 6y\y\ + 8yS + 3 y! + 6y\). (10-46) 

For a simple digraph the figure-counting series A(x) = 1 -f- jt is applicable, 
because a given ordered pair of vertices (a, b ) either does or does not have an 
edge (directed) from a to b. On substituting 1 + x* for every in (10-46), we 
get the following configuration-counting series for four-vertex, simple 
digraphs. 

B(x) = ^[(1 + x) 12 + 6(1 + jc) 2 (1 + x 2 ) 5 + 8(1 + x*Y 

+ 3(1 + x 2 ) 6 + 6(1 + jt 4 ) 3 ] 

= 1 + x + 5x 2 + I3x 3 + 27x 4 + 38x 5 + 48* 6 
+ 38x 7 + 27 x 8 + 13jc 9 + 5x 10 + x 11 + x 12 . 


(10-47) 
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Fig. 10-13 Simple unlabeled digraphs of four vertices and two edges. 


The coefficient of x J in (10-47) is the number of simple digraphs with four 
vertices and j edges. For example, the five digraphs of two edges are shown in 
Fig. 10-13. ' 

The general expression for the cycle index, Z(M„), of the permutation 
group on n(n — 1) ordered pairs induced by S„ is given in [1-5], page 180. 
Digraphs with parallel edges can be enumerated by substituting the appro¬ 
priate figure-counting series, say (10-43), in Z(M„). 


SUMMARY 

Enumeration of graphs is one of the most involved areas in graph theory 
and deserves an entire volume to itself. In this chapter, we have briefly 
presented some enumerative techniques—the most important of them being 
Polya’s counting theorem. The major problem in using Polya’s theorem is 
finding the appropriate permutation group and then obtaining its cycle index. 

One could think of a hundred different types of graphs to be counted— 
each presenting a special problem. We have, in this chapter, counted the 
following five types of unlabeled graphs (enumeration of labeled graphs is 
much easier): (1) rooted trees, (2) free trees, (3) simple graphs, (4) multigraphs, 
and (5) simple digraphs. Important as these types of graphs are, they were 
enumerated mainly as illustrations. One could, for example, be interested in 
counting all unlabeled, simple graphs with n vertices that are (1) connected, 
or (2) planar, or (3) nonseparable, or (4) self-dual, and so on. 

Many such types of graphs have been enumerated and reported as 
research papers in the literature, but there are many types of graphs that have 
yet to be counted. 


REFERENCES 

Chapters 1,3,4, 5, and 6 of [10-1] are strongly recommended to supplement the 
material presented in this chapter. 

For an exhaustive survey of the literature in graph enumeration, see the paper by 
Harary [10-4], as well as Chapter 15 of [1-5], which contains a list of 66 solved problems 
with the appropriate references where the solutions are to be found. The latest list of 27 
unsolved problems in graph enumeration is discussed in another article by Harary [10-51. 
For a lucid exposition of Polya’s counting theorem, see the paper by deBruijn, which 
appears as Chapter 5 in [10-1], or see Chapter 5 of the book by Liu [8-3]. For some 
excellent illustrations of applications of Polya’s theorem to graph enumeration, see [10-2], 
[10-6], [10-7], [10-8], and Chapter 6 of [3-11]. 
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Many counting problems in chemistry, physics, biology, information theory, and so 
on, can be regarded as graph-enumeration problems. A survey of such applications is 
given in Chapter 6 of [10-1] and in [10-4]. A detailed treatment of an application to a 
problem in statistical mechanics is given in [10-9]. An application to counting of distinct 
automata is given in [12-5]. 

The pioneering papers of Cayley, Redfield, and Polya are not included in the following 
list. They have been referred to in most of the following: 

10-1. Beckenbach, E. F. (ed.). Applied Combinatorial Mathematics, John Wiley & Sons, 
Inc., New York, 1964. 

10-2. Harary, F., “The Number of Linear, Directed, Rooted and Connected Graphs,” 
Trans. Am. Math. Soc., Vol. 78, 1955, 445-463. 

10-3. Harary, F., “Note on the Polya and Otter Formulas for Enumerating Trees,” 
Michigan Math Journal, Vol. 3, 1956, 109-112. 

10-4. Harary, F., “Graphical Enumeration Problems,” in Graph Theory and Theoretical 
Physics (F. Harary, ed.), Academic Press, Inc., New York, 1967, 1-41. 

10-5. Harary, F., “Enumeration Under Group Action: Unsolved Problems in Graphical 
Enumeration IV,” J. Combinatorial Theory, Vol. 8, 1970, 1-11. 

10-6. Palmer, E. M., “Methods for the Enumeration of Multigraphs,” in The Many 
Facets of Graph Theory (G. Chartrand and S. F. Kapoor, eds.), Springer-Verlag 
New York, Inc., New York, 1969, 251-261. 

10-7. Read, R. C., “On the Number of Self-Complementary Graphs and Digraphs,” 
J. London Math. Soc., Vol. 38, 1963, 99-104. 

10-8. Robinson, R. W., “Enumeration of Non-separable Graphs,” J. Combinatorial 
Theory, Vol. 9, No. 4, Dec. 1970, 327-356. 

10-9. Uhlenbeck, G. E., and G. W. Ford, “Theory of Linear Graphs with Applications 
to the Theory of the Virial Development of the Properties of Gases,” in Studies in 
Statistical Mechanics, Vol. 1 (J. de Boer and G. E. Uhlenbeck, eds.), North- 
Holland Publishing Company, Amsterdam, 1962, 123-211. 


PROBLEMS 

10-1. Satisfy yourself that for a set of graphs isomorphism (as defined in Section 2-1) 
is indeed an equivalence relation. That is, the relation is reflexive, symmetric, and 
transitive. 

10-2. Prove that a vertex v appears in sequence (10-3) m times if and only if degree of 
v — m — 1. 

10-3. Prove that a digraph in which the in-degree as well as the out-degree of every 
vertex is one can be decomposed into one or more vertex-disjoint directed circuits. 
{Hint: In such a digraph every component is a directed circuit.) 

10-4. Prove that a subset A of a finite group forms a subgroup if the subset satisfies 
the closure postulate. {Hint: Show the existence of the inverse of an element 
a e A as follows: Elements a, a 2 , a 3 ,. .. cannot all be distinct because the group 
is finite, but they must all be in A because of the closure property. Suppose that 
aP = ap, where p > q. Therefore, aP~v = 1 ora -1 = ap-^K) 

10-5. Prove that if a set of permutations P on an object set S forms a group, the set R 
of all permutations induced by P on set S x 5 also forms a group. [Hint: Prove 
closure by showing that the composition of two permutations on S x S induced 
by any two permutations n\, n 2 (in P) is the permutation induced by the com¬ 
position {Hi-iii). Then use Problem 10-4.] 
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10-6. Show that the cycle index of a group consisting of the identity permutation only 
is y \|, k being the number of elements in the object set. 

10-7. Show that the cycle index of the induced pair group R 3 is the same as that of £3. 
That is, 

Z(R 3 ) = -g-Cy? + 3yiy 2 + 2 y 3 ). 

10-8. Show that the cycle index of S 5 , the full symmetric group of degree five, is 
Z(S s ) = + I0yfy 2 + 20 y\y 3 + \5y x y\ 

+ 30yi/4 + 20>’2y3 + 24y$). (10-48) 

10-9. Show that the cycle index of the unordered pair group Rs (on the set of 10 
unordered pairs induced by £5) is 

Z(Rs) = -^(yl 0 + 10 y*y} + 20yiy\ + 15 y\y\ 

+ 30 y 2 y} + 20yi>’3>'6 + 24 y\). 

{Hint: Use the result of Problem 10-8.) 

10-10. Find the different ways of painting the six vertices of an octahedron with three 
colors. Two octahedrons are colored distinctly if they cannot be made to coincide 
by any rotation. [Hint: First show that the cycle index of the permutation group is 

+ 6yfy 4 + 1y\y\ + 6 y\ + 8 y\). 

Then substitute the figure-counting series 1 -f x + x'.} 

10-11. List all partitions of 5, and use them to find w 6 , the number of unlabeled trees of 
six vertices. (You may use the values of wi, u 2 , ■ .., u 5 given in this chapter.) 

10-12. Given a square, show that there are exactly eight distinct motions (combinations 
of rotations and reflections) which bring the square into coincidence with itself. 
Show that these motions form a group (called dihedral group D 4 ). Furthermore, 
show that the cycle index of this group is 

Z{D 4 ) = -g-(.yf + 2 yjy 2 + 3.v^ + 2y 4 ). 

10-13. Show that the order of D„, the group of symmetries of a regular w-sided polygon, 
is In. Find the cycle index of D„. 

10-14. Suppose that we are to make necklaces with four beads—some blue and some 
green. How many distinct necklaces are possible? Two necklaces are considered 
indistinguishable if one can be made identical to the other by any combination of 
rotation and flipping. [Hint: Use Z{D 4 ) and follow the procedure of example 1.] 

10-15. Find the number of different ways of painting the four faces of a pyramid with 
two colors. 

10-16. Find the counting series for unlabeled, simple, connected graphs with exactly one 
circuit. [Hint: Use Z(D„) and consider the graph as consisting of a single circuit 
with one or more trees attached to its vertices.] 

10-17. Find the counting series for the structural isomers of saturated alcohols 
C„H 2 b+iOH. {Hint: Consider the compound as an rt-vertex rooted tree in which 
each vertex is a carbon atom. The carbon atom carrying the OH radical corre¬ 
sponds to the root. Then find the counting series for unlabeled, rooted trees in 
which the root is at most of degree three and the nonroot vertices are at most 
of degree four.) 
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10-18. A permutation n applied on the vertex set V of a graph G is called an automorphism 
of G, if 7t preserves the adjacency. That is, an automorphism of G is an isomor¬ 
phism with itself. Prove that the set of all automorphisms Q(G) on G forms a 
group. {Hint: This group will obviously be a subgroup of S„. Use the result of 
Problem 10-4, after observing that an automorphism followed by another is also 
an automorphism.) 

10-19. Find the automorphism group Cl(G) of a graph G if G is (a) a complete graph 
of n vertices, and (b) a circuit with n vertices. Find a graph with minimum number 
of vertices n > 1 in which Q((7) consists of only the identity permutation. 

10-20. Prove that the number of ways an unlabeled n-vertex graph can be labeled is 
n\/\Q{G)\, where |Q(G)| is the order of the automorphism group Q(<7) of G. 
{Hint: The problem requires some additional knowledge of group theory. The 
proof can be found on page 180 in [1-5].) 




GRAPH-THEORETIC ALGORITHMS 
AND COMPUTER PROGRAMS 


To be able to use a digital computer in solving graph-theoretic problems 
is undoubtedly an important part of learning graph theory, especially for those 
interested in applications. Most of the practical problems which call for graph 
theory involve large graphs—graphs that are virtually impossible for hand 
computation. In fact, one of the reasons for the recent growth of interest in 
graph theory has been the arrival of the high-speed electronic computer. 
Problems that hitherto were of academic interest only are suddenly being 
solved by the computer, and their solutions are applied to practical situations. 
Computer programs have been written to handle successfully large graphs 
encountered in PERT, flow problems, transportation networks, electrical 
networks, circuit layouts, and the like. 

We must hasten to add, however, that although our computers are very 
fast and operate at nanosecond (10“ 9 second) speeds, they quickly reach their 
limit if used as a brute force to solve graph-theory problems (in fact, any 
combinatorial problem). Consider, for example, the problem of finding a 
lowest-weight Hamiltonian circuit in a weighted complete graph of n vertices, 
that is, the traveling salesman problem. There are — 1)! different 
Hamiltonian circuits. One may be tempted to use brute force and generate all 
Hamiltonian circuits and compare their weights. For a graph with 10 vertices, 
the number of Hamiltonian circuits is ^*9! = 181,440, and this method may 
be all right. But for a graph of 20 vertices, we have 

i-(/i - 1)! = 19! ~ 6 x 10 16 , 

and to perform ^*19! operations at the rate of even one operation per nano¬ 
second would require about 

6 x 10 16 _ ~ 

10 9 x 3 x 10 7 2 yearS ‘ 
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Thus it is amply clear that without the aid of mathematical tools one cannot 
hope to get the desired numerical answer, regardless of the speed of the elec¬ 
tronic computer. The power of the computer must be combined with the 
ingenuity of mathematical techniques. 

As is the case with all combinatorial problems, the manipulation and 
analysis of graphs and subgraphs is essentially nonnumerical. That is, in 
graph-theoretic programs it is primarily the decision-making ability of the 
computer that is used rather than its ability to perform arithmetic operations. 

In this chapter it is assumed that the reader has some familiarity with 
computer programming. 

11-1. ALGORITHMS 

An algorithm is, in essence, a recipe for solving a certain mathematical 
problem. It consists of a set of instructions that when followed step by step 
will lead to the solutions of.the problem. Every step in an algorithm must be 
precisely and unambiguously defined, and an algorithm must terminate after 
having solved the given problem in a finite number of steps. As pointed out 
by Knuth [11-39], page 4, every algorithm must have five important features: 
finiteness, definiteness, input, output, and effectiveness. 

An algorithm can be expressed in different forms: (1) the steps may be 
written in English; (2) it may be in the form of a computer program written 
in complete detail in the language understandable by the machine in use; or 
(3) the algorithm may be expressed in a form between these two extremes, 
such as a flow chart. Each form has certain advantages and shortcomings. 
Usually, an algorithm is first expressed in ordinary language, then converted 
into a flow chart, and finally written in the detailed and precise language so 
that a machine can execute it. 

For our purpose the flow chart is the best. It is the most popular form of 
expressing an algorithm. It is independent of the programming language and 
of the computer the student may have at his disposal. As examples of actual 
programs, listings of several tested programs are provided at the end of this 
chapter. One of the programs is in APL (A Programming Language), and the 
others are in FORTRAN. 

Efficiency of Algorithms: An algorithm must not only do what it is sup¬ 
posed to do, but must do it efficiently. The two main criteria for efficiency of 
an algorithm are the memory and computation-time requirements as a func¬ 
tion of the size of the input. In our case the input is a graph, and its size is 
the number of vertices, n, and the number of edges, e. For most graph prob¬ 
lems the memory requirement is generally not the bottleneck, but the com¬ 
putation time can be (as we saw in the opening remarks of this chapter). 

In evaluating the figure of merit of an algorithm one may seek the “worst- 
case” execution time (i.e., the time taken for the worst possible choice of a 
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graph of the given size), or the “best-case” execution time or the “average- 
case.” 

Often, more than one algorithm is available for one graph-theoretic 
problem. Sometimes one algorithm can easily be seen to be more efficient 
than others, for all nontrivial graphs. In many cases, however, the relative 
efficiencies can be compared only in the context of the size and structure of 
the graph, detailed implementation of the algorithms, and the computer used. 

A detailed analysis of the performance of a graph-theoretic algorithm is 
extremely involved. We will not indulge in such analyses, as that would re¬ 
quire a chapter in itself. We will, however, make some gross observations on 
complexities of the algorithms; namely how the computation time grows as 
a function of n or e, as n and e become very large, assuming that the worst- 
case graph is provided as the input. Such an index of performance, too un¬ 
realistic to be useful in estimating the expected computation time of a 
program, is often valuable in classifying algorithms and in their theoretical 
studies. 

11-2. INPUT: COMPUTER REPRESENTATION 
OF A GRAPH 

An algorithm has some inputs—the data with which the algorithm begins 
(just as a recipe for a dish calls for raw ingredients). Naturally, the input for 
our algorithms here will be one or more graphs (or digraphs). A graph is 
generally presented to and is stored in a digital computer in one of the follow¬ 
ing five forms. Each has advantages and disadvantages. The choice depends 
on the graph, the problem, the language, the type of machine, and whether 
or not the graph is modified during the course of the computation. 

(a) Adjacency Matrix: The most popular form in which a graph or digraph 
is fed to a computer is its adjacency matrix. For example, algorithms de¬ 
scribed in [11-25] and [11-47] use the adjacency matrix. After assigning a dis¬ 
tinct number to each of the n vertices of the given graph (or digraph) (7, the 
n by n binary matrix X((7) is used for representing G during input, storage, 
and output. Since each of the n 2 entries is either a 0 or a 1, the adjacency 
matrix requires n 2 bits of computer memory. Bits can be packed into words. 
Let w be the word length (i.e., the number of bits in a computer word) and n 
be the number of vertices in the graph. Then each row of the adjacency ma¬ 
trix may be written as a sequence of n bits in \n/w] machine words, (fx] denotes 
the smallest integer not less than x.) The number of words required to store 
the adjacency matrix is, therefore, n\n/w]. 

The adjacency matrix of an undirected graph is symmetric, and therefore 
storing only the upper triangle is sufficient. This requires only n(n — l)/2 
bits of storage. This saving in storage, however, often costs in increased com¬ 
plexity and computation time. In some problems it is worth it. 
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It must be kept in mind that the adjacency matrix is defined for graphs 
without parallel edges. As discussed in Chapter 7, it is not possible to repre¬ 
sent parallel edges in an adjacency matrix. 

(b) Incidence Matrix: Occasionally, an incidence matrix is also used for 
storing and manipulation of a graph. The algorithm in [11-68], for example, 
uses the incidence matrix A (G). An incidence matrix requires n-e bits of 
storage, which might be more than the n 2 bits needed for an adjacency matrix, 
because the number of edges e is usually greater than the number of vertices 
n. On rare occasions it may be advantageous to use the incidence matrix ra¬ 
ther than the adjacency matrix, in spite of the increased requirements in 
storage. Incidence matrices are particularly favored for electrical networks 
and switching networks. 

(c) Edge Listing: Another representation often used is to list all edges of 
the graph as vertex pairs, having numbered the n vertices in some arbitrary 
order. For example, the digraph in Fig. 11-1 would appear as a set of the 
following ordered pairs: (1,2), (2, 1), (2, 4), (3, 2), (3, 3), (3, 4), (4, 1), (4,1), 
(5,2). Had this graph been undirected, we would simply ignore the ordering in 
each vertex pair. 

Clearly, parallel edges and self-loops can be included in this representation 
of a graph or digraph. 

The number of bits required to label (1 through n) each vertex is b, where 

2 b ~' <n< 2 b . 

And since each of the e edges requires storing two such numbers, the total 
storage required is 

2 e-b bits. 

Comparing this with n 2 , we see that this representation is more economical 
than the adjacency matrix if 

2 e-b < n 2 . 

In other words, for a graph whose adjacency matrix is sparset, edge listing 
is a more efficient method of storing the graph. 

Edge listing is a very convenient form for inputting a graph into the com¬ 
puter, but the storage, retrieval, and manipulation of the graph within the 
computer become quite difficult. For example, extensive search techniques 
would be required for finding out whether or not a graph is connected 
(Algorithm 1 in Section 11-4). 

(d) Two Linear Arrays: A slight variation of edge listing is to represent 
the graph by two linear arrays, say F = (/,,/ 2 ,... ,f e ) and H = (/?,, h 2 , 

tA matrix that contains many zero elements is called a sparse matrix. A sparse adjacency 
matrix implies a small e/n ratio. 
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Fig. 11-1 A digraph. 

h e ). Each entry in these arrays is a vertex label. The ith edge e t is from vertex 
ft to vertex h t if G is a digraph. (If G is undirected, just consider e { as between 
fi and h r ) For example, the digraph in Fig. 11-1 would be represented by the 
two arrays 


F=( 5, 2,1,3, 2, 4, 4, 3, 3), 

H = ( 2, 1,2, 2, 4, 1, 1,4, 3). 

This representation, which was used in the algorithm in [11-58], lends itself 
to convenient sorting in weighted graphs. The storage requirements are the 
same as in (c). 

(e) Successor Listing: Another efficient method used frequently for graphs 
in which the ratio eln is not large is by means of n linear arrays. After 
assigning the vertices, in any order, the numbers 1, 2,...,«, we represent 
each vertex A: by a linear array, whose first element is k and whose remaining 
elements are the vertices that are immediate successors of k, that is, the vertices 
which have a directed path of length one from k. (In an undirected graph 
these are simply vertices adjacent to k.) The five-vertex digraph in Fig. 11-1 
will appear as follows in this representation. 

1:2 
2: 1,4 
3:2, 3, 4 
4:1, 1 
5:2 


For an undirected graph the neighbors (rather than the successors) of 
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every vertex are listed. Therefore, each edge appears twice—an obvious 
redundancy. 

To compare its storage efficiency with that of the adjacency matrix, let 
</ av be the average degree (out-degrees in the case of a digraph) of the vertices 
in the graph. Assuming that one computer word is needed for the label of 
each vertex, the total storage requirement for an n vertex graph is n{\ + d iV ) 
words. Thus the successor listing is more efficient than the adjacency matrix 
if 


w being the word length. 

The successor or neighbor listing form is extremely convenient for path¬ 
finding algorithms, and for a depth-first search on the graph [11-61]. 

You must have observed that the foregoing methods of graph representa¬ 
tion are not entirely different. In fact, they are necessarily related in that they 
convey the same information. Simple programs can be written to convert one 
form into another (Problem 11-2). Additional variations in these representa¬ 
tions can be made to suit the requirements at hand. For instance, a weighted 
graph can be represented by an n by n weight matrix (also called cost matrix 
or distance matrix), which is like the adjacency matrix except that instead of 
l’s the weights of the edges appear as the entries in the matrix. It should, 
however, be kept in mind that in many problems the efficiency of the algo¬ 
rithm may depend on the form in which the graph is presented. Thus the 
proper choice of the data structure is important. 


11-3. OUTPUT 

Every algorithm has an output—the cooked dish from the recipe. Unlike 
the input, which is one or more graphs, the output will vary from problem to 
problem. If the output consists of subgraphs, we may make the program 
print the appropriate adjacency matrices. On the other hand, if an output is, 
for instance, a yes or no to the question of planarity of a given graph, we may 
ask the program to simply print YES or NO. In addition, if the answer is 
YES, we may choose to get the planar representation of the graph; or if the 
answer is NO, we may ask for the thickness of the graph. For a shortest-path 
algorithm, we may simply wish to print the distance (shortest) between a pair 
of specified vertices * and y. Or one may desire to output a sequence of edges 
(or vertices) which describes a shortest path between x and y. And so forth. 
The outputs are as varied as the algorithms. 

Let us now proceed with some specific algorithms. 
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11-4. SOME BASIC ALGORITHMS 
Algorithm 1: Connectedness and Components 

The first questions one is most likely to ask when encountering a new 
graph G will be: Is G connected? If G is not connected, what are the com¬ 
ponents of G? Therefore, our first algorithm will be one that determines the 
connectedness and components of a given graph. 

In addition to being an important question in its own right, the question 
of connectedness and components arises in many other algorithms. For 
example, before testing a graph G for separability, planarity, or isomorphism 
with another graph, it may be better for the sake of efficiency to determine 
the components of G and then subject each component to the desired scrutiny. 
The connectedness algorithm is very basic and may serve as a subroutine in 
more involved graph-theoretic algorithms. (The reader may be reminded here 
that although in drawing a graph one might see whether a graph is connected 
or not, the connectedness is by no means obvious to a computer or human 
being if the graph is presented in other forms, such as those discussed in Sec¬ 
tion 11-2.) 

Given the adjacency matrix X of a graph, it is possible to determine 
whether or not the graph is connected by trying various permutations of rows 
and the corresponding columns of X, and then checking if it is in a block- 
diagonal form. (See observation 5 in Section 7-9.) This, however, is an ineffi¬ 
cient method, because it may involve n ! permutations. A more efficient method 
would be to use Corollary b of Theorem 7-8, and check for zeros in the matrix 

Y = X + XH-+ X n >. 

This too is not very efficient, as it involves a large number of matrix mul¬ 
tiplications. The following is an efficient algorithm: 

Description of the Algorithm: The basic step in this algorithm is the fusion 
of adjacent vertices (recall Section 2-7). We start with some vertex in the graph 
and fuse all vertices that are adjacent to it. Then we take the fused vertex and 
again fuse with it all those vertices that are adjacent to it now. This process 
of fusion is repeated until no more vertices can be fused. This indicates that 
a connected component has been “fused” to a single vertex. If this exhausts 
every vertex in the graph, the graph is connected. Otherwise, we start with 
a new vertex (in a different component) and continue the fusing operation. 

In the adjacency matrix the fusion of the yth vertex to the zth vertex is 
accomplished by OR-ing, that is, logically adding the jth row to the zth row 
as well as the j'th column to the zth column. (Remember that in logical adding 
14-0 = 0+1 = 1 + 1 = 1 and 0 + 0 = 0.) Then the y'th row and the yth 
column are discarded from the matrix. (If it is difficult or time consuming to 
discard the specified rows and columns, one may leave these rows and columns 
in the matrix, taking care that they are not considered again in any fusion.) 
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Note that a self-loop resulting from a fusion appears as a 1 in the main 
diagonal, but parallel edges are automatically replaced by a single edge be¬ 
cause of the logical addition (or OR-ing) operation. These, of course, have 
no effect on the connectedness of a graph. 

The maximum number of fusions that may have to be performed in this 
algorithm is n — 1, n being the number of vertices. And since in each fusion 
one performs at most n logical additions, the upper bound on the execution 
time is proportional to n(n — 1). 



Fig. 11-2 Algorithm 1: Components of G, 
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A proper choice of the initial vertex (to which adjacent vertices are fused) 
in each component would improve the efficiency, provided one did not pay 
too much of a price for selecting the vertex itself (see Problem 11-6). 

A flow chart of the “Connectedness and Components Algorithm” is 
shown in Fig. 11-2. 

A complete computer program, ready to be executed, written in APL\360, 
together with the legend identifying the variables used in the program, is 
given at the end of the chapter. Note that the program selects a vertex with 
maximum degree as the initial vertex in each component. 

To illustrate the program, an input and the resulting output are shown as 
follows: The input is a 20 by 20 adjacency matrix representing a 20-vertex 
graph, and the output is a list of components (COMP) followed by the names 
of vertices (VERT) included in each component. Vertex i corresponds to the 
ith row and column in X. 

A slightly modified program (without selecting vertices of maximum 

INPUT 
N = 20 

01 1 01 000001 000000000 
1 01 01 000001 000000000 
1 1 001 000000000000000 
00000000000000000000 
1 1 1 00000001 000000000 
0000001 1 01 000001001 1 
00000101010000000011 
000001 1 001 000000001 0 
000000000001 10001000 
000001 1 1 00000001001 1 
1 1 001 000000000000000 
000000001 00000001 000 
000000001 00000000000 
000000000000001 00000 
00000000000001 000000 
000001 0001 000000001 1 
000000001 001 00000000 
00000000000000000000 
00000111010000010001 
000001 1 001 000001 001 0 

OUTPUT 

COMP 1; VERT: 6 7 8 10 16 19 20 

COMP 2; VERT: 1 2 3 5 11 

COMP 3; VERT: 9 12 13 17 

COMP 4; VERT: 14 15 

COMP 5; VERT: 4 

COMP 6; VERT: 18 
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degree as the initial vertices and without discarding the rows and columns 
after they are OR-ed with the initial row) took 35 FORTRAN statements to 
write. The execution time of this FORTRAN program for a typical 50-vertex 
graph (with varying number of edges and components) on the IBM 7044 was 
second. 

Algorithm 2: A Spanning Tree 

Perhaps the best known and most frequently used algorithms in graph 
theory are the spanning-tree algorithms. In its simplest form a spanning-tree 
algorithm yields one spanning tree in a given connected graph. If the graph is 
disconnected, the algorithm should produce a spanning forest containing 
n — p edges, where p > 1 is the number of components in the disconnected 
graph. Clearly then, as a by-product of such an algorithm, we can find out 
whether or not the graph is connected, and if the graph is disconnected, its 
components can be identified. In fact, sometimes a spanning-tree algorithm 
is used for testing connectedness of a graph. If, on the other hand, the given 
graph has a weight or distance associated with each edge (weighted graph), 
we may wish to find a spanning tree with smallest possible weight. The signi¬ 
ficance of an algorithm for such a tree (called minimal or shortest spanning 
tree) was discussed in Section 3-10. A spanning tree is also needed for obtain¬ 
ing a fundamental set of circuits. As we saw in Section 3-9, some algorithms 
for generation of all spanning trees (a much more difficult task) in a given 
connected graph G also start by first obtaining one spanning tree. 

Description of the Algorithm: Let the given undirected self-loop-free (if 
the graph has any self-loops, they may be discarded) graph G contain n ver¬ 
tices and e edges. Let the vertices be labeled 1,2,...,«, and the graph be 
described by two linear arrays F and H [i.e., in the form (d) of Section 11-2] 
such that f e F and h { £ H are the end vertices of the ith edge in G. 

At each stage in the algorithm a new edge is tested to see if either or both 
of its end vertices appear in any tree formed so far.t At the Arth stage, 1 < k < 
e, in examining the edge ( f k , h k ) five different conditions may arise: 

1. If neither vertex f k nor h k is included in any of the trees constructed so 
far in G, the Arth edge is named as a new tree and its end vertices f k , h k are 
given the component number c, after incrementing the value of c by 1. 

2. If vertex f k is in some tree T { (i = 1,2,..., c) and h k in tree T } (j — 1, 
2,..., c, and i ^ j), the Arth edge is used to join these two trees; there¬ 
fore, every vertex in T. is now given the component number of T r The 
value of c is decremented by 1. 

3. If both vertices are in the same tree, the edge ( f k , h k ) forms a funda¬ 
mental circuit and is not considered any further. 

tlnitially there is no tree formed. The very first edge considered will always 

occur in a spanning tree (or forest). Thus the spanning tree (or forest) generated by this 
algorithm is very much dependent on the ordering of the edges. 
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4. If vertex f k is in a tree T t and h k is in no tree, the edge (f k , h k ) is added 
to T, by assigning the component number of T. to h k also. 

5. If vertex f k is in no tree and h k is in a tree T p the edge (f k , h k ) is added 
to Tj by assigning the component number of Tj to f k also. 

These five cases are marked by circled numbers in the flow chart of the 
algorithm shown in Fig. 11-3. 

The efficiency of a computer program based on this algorithm depends 
mainly on the speed with which we can test whether or not the end vertices of 
the edge under consideration have occurred in any tree formed so far. For 
this testing, we maintain a linear array (called VERTEX in the program list¬ 
ing) of size n. When an edge (i,j) is included in the cth tree, the /th and /th 
entries in this array are set to c. Subsequently, when another edge (f k , h k ) is 
examined, it is only necessary to check if the f k th and the h k th entries in array 
VERTEX are nonzero. A zero in the qth position in the array indicates that 
the vertex q has not so far been included in any tree. At the end of the execu¬ 
tion, this array VERTEX identifies the components of the graph. 

Unlike a component, a tree cannot be described by a set of vertices alone. 
Therefore, we must have an array of edges as the output. Let this linear array 
be called EDGE. If the /cth edge (in the original order in which the edges 
were placed) is in the cth tree, EDGE(Zc) = c; otherwise, it is zero. All zero 
entries in array EDGE correspond to the chords (i.e., the edges not included 
in the spanning tree or forest). This array, together with arrays F and H, 
uniquely identifies the spanning tree (or forest) generated by this algorithm. 

In this algorithm the main loop is executed e times (e being the number of 
edges). The time required to test whether or not the end vertices have appear¬ 
ed in any tree is constant—independent of both e and n. Thus the time bound 
for the execution of the algorithm is proportional to e. t In case the ratio e/n 
is high, execution time can be reduced by introducing a new variable to keep 
count of the edges included in the tree. When this variable reaches the value 
of n — 1, the program would terminate (only if the graph is connected; 
otherwise, we must examine every edge). 

A ready-to-be-executed program in FORTRAN language, based on this 
algorithm, is given at the end of this chapter. For an ALGOL listing of the 
same program, see [11-58]. A randomly generated graph of 50 vertices took 

second on the IBM 7044, using FORTRAN IV. 

Minimal-Spanning-Tree Algorithms: As discussed in Section 3-10, we can 
use the algorithm suggested by Kruskal to find a shortest spanning tree in a 
graph G in which every edge has a distance (or weight) associated with it. 
This can be accomplished with the algorithm just described. The only addi- 

tThe time required for merging two partial trees (T v T^) as implemented in the 
FORTRAN program is not independent of n. There are, however, very efficient set¬ 
merging algorithms available which almost accomplish this. 




Fig. 11-3 Algorithm 2: Spanning tree/forest. 

tional work required is to first sort the edges in a nondecreasing order of their 
weights, before representing them by F and H arrays. That is, the following 
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set of inequalities must be satisfied: 

wt of edge (/„ h.) < wt of (f l+l , h i+ ,), for all 1 < / < e — 1. 

Because of the sorting involved, Kruskal’s algorithm is not as efficient as 
the one due to Prim [3-10] (which was also discovered independently by 
Dijkstra [11-16]). The latter algorithm, as outlined in Sec. 3-10, requires no 
sorting of edges, but builds up a minimal spanning tree by successively con¬ 
necting the partially formed tree to its nearest neighbor. For a FORTRAN 
listing of an efficient implementation of Prim’s minimal-spanning-tree 
algorithm, see [11-70]. The computation time of this algorithm is proportional 
to n 2 , n being the number of vertices, [11-22]. 

Minimal spanning tree has been found quite useful in providing a lower 
bound on the length of the traveling salesman’s route [11-28]. 

Spanning Trees with Desired Properties: Instead of a shortest spanning 
tree, one may wish to find a longest spanning tree. Or one may be interested 
in trees with other desired properties and constraints, such as a spanning tree 
with a specified maximum degree or diameter. Algorithm 2 with appro¬ 
priate additional sorting or testing can be used for such purposes also. 

Generating All Spanning Trees: As we shall see in Chapter 13, analysis of 
electrical networks basically reduces to finding all spanning trees in graphs. 
Because of this important application, more than a dozen different algorithms 
for generation of all spanning trees have been proposed. In Section 3-9 we 
discussed one of these methods, the method of cyclic interchange. Since the 
number of spanning trees even in a small graph is very large, the efficiency of 
these algorithms is of paramount importance. A survey of these methods was 
done by Chase in his Ph.D. thesis [11-8]. He concludes that the most efficient 
algorithm is of the type suggested by Minty [11-43], which essentially consists 
of successively reducing a graph by operations of deletion of an edge and 
fusion of its end vertices. From the spanning trees of reduced graphs (which 
are much smaller) the spanning trees of the original graph are obtained. To 
ensure that the algorithm terminates, graphs below a certain size are not 
reduced any further; instead their spanning trees are obtained directly. A 
compact ALGOL program based on this method is given in [11-42]. 

Algorithm 3: A Set of Fundamental Circuits 

Sometimes we are required to find a set of fundamental circuits in a given 
graph. The spanning-tree algorithm just described can be used for generating 
fundamental circuits if the following additional work is performed: 

While examining the k\h edge (f k , h k ) in Algorithm 2, if condition 3 arises 
(i.e., both vertices f k and h k occur in the same tree T ,.), then instead of simply 
rejecting this edge we must find those edges in T t that form the path between 
f k and h k . This path and the edge (f k , h k ) constitute a fundamental circuit. 
Finding this path is the main problem here. In [11-52] a tree-felling procedure 
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has been suggested, where the edge ( f k ,h k ) is added to T n and all pendant 
vertices of the resulting graph are deleted iteratively. This method, however, 
turns out to be inefficient. More efficient methods have been proposed by 
Welch [11-68], Gottlieb and Corniel [11-25], and Paton [11-47]. Among these 
three, Paton’s algorithm appears to be the most efficient, and is as follows: 

Description of the Algorithm: Here also each edge is tested to see if it 
forms a circuit with the tree constructed so far; but instead of taking the 
edges themselves in an arbitrary order (as was done in Algorithm 2), we 
select a vertex z and examine this vertex by looking at every edge incident on 
z. (Vertex z, as we shall shortly see, is the vertex added most recently to the 
partially formed tree.) Let the vertices of the given connected graph G = 
(V, E) be labeled 1,2,...,«, and the graph be given by its adjacency matrix 
X. Let The the current set of vertices in the partially formed tree, and let W 
be the set of vertices that are yet to be examined (i.e., those vertices, in T as 
well as not in T, which have one or more unexamined edges incident on them). 
Initially, 7=0 and W = K, the entire set of vertices. 

We start the algorithm by setting T = 1, the first vertex, and W = V. 
Vertex 1 will be regarded as the root of the tree to be formed. After initializa¬ 
tion, the following procedure is used: 

1. If T n W = o, then the algorithm is terminated. 

2. If T n W ^ 0 , choose a vertex z in T n W. 

3. Examine z by considering every edge incident on z. If there is no such 
edge left, remove z from W, and go to step 1. 

4. If there is such an edge (z, p), test if vertex p is in T. 

5. If p e T, find the fundamental circuit consisting of edge (z, p) together 
with the unique path from z to p in the tree (formed so far). Delete 
edge (z, p) from the graph, and go to step 3. 

6. If p ^ T, add edge (z, p) to the tree and vertex p to set T. Delete edge 
(z, p) from the graph, and go to step 3. 

As mentioned earlier, the only tricky part in this algorithm is in step 5. 
How do we find the unique path from z to p in the tree? The following pro¬ 
cedure provides an answer: 

We maintain a pushdown list (a stack) TW — T n W, which stores 
those vertices in the tree that have not yet been examined. The most recently 
added vertex is at the top of the stack. Each time a vertex is taken for exami¬ 
nation it is taken from the top of this stack, and is removed from the stack. 
Two linear arrays of length n are employed: LEVEL(/) being the distance of 
vertex i from the root of the spanning tree (i.e., vertex 1), and PRED(/) being 
a vertex v such that (/', v) is an edge in the tree with v nearer the root. In 
other words, PRED(/) is the predecessor of / in the path from the root to /. 
LEVEL(/) =-- — 1 if and only if vertex / is not in set T, the current set of tree 
vertices. Initially, LEVEL(l) is set to0 and LEVEL(/) to — 1 for / = 2, 3,..., n. 
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In step 5 vertex z is under examination and an edge (z, p) has been found 
such that vertex p e T. To find the fundamental circuit formed by (z, p) with 
the tree, we trace the unique path from z to p in the tree by successively find¬ 
ing the predecessors PRED(z), PRED(PRED(z)), . . . , till we encounter 
PRE D(p), the predecessor of p. In other words, as shown in Fig. 11-4, the 


(Unexamined vertex in T ) 


1 (Root) 



i 



fundamental circuit generated is 

z, PRED(z), PRED(PRED(z)), . . . , PRED (p),p, z. 

The most important thing to note is that the predecessor PRED(A:) of 
every vertex k in T is a vertex which is either already examined or is being 
examined. That is, if k e T n W, then 

PRED(Ar) £ W but PRED(Ar) e T. 

A flow chart of the algorithm is given in Fig. 11-5, and a ready-to-be- 
executed FORTRAN program is provided at the end of the chapter. 

The execution time is bounded by n\ 2 < v < 3, and the value of v de¬ 
pends on the structure of the graph and also on the labeling of vertices [11-47]. 



Read 

: n, X 


< 

— 


Initialize: 
TW <- 1 
LEVEL *— 1 
LEVEL (1)^0 



z <- top of TW 
pop TW 



Pick an edge 

(z,p) 

(i.e. x zp = \ ) 



Add (z, p) to tree 
push p in TW 
PRED (p) +- z 

LEVEL (p) <- LEVEL (z) + 1 


x zp -0 
x pz <-0 


i _ 

Output the 

fundamental 

circuit 

z, PRED (z), 
PRED (PRED (z)), 


PRED ( p),p, z 



Fig. 11-5 Algorithm 3: Fundamental circuits. 
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Although for simplicity we assumed that the graph is connected, the 
algorithm will work for disconnected graphs also. First, it will produce all 
fundamental circuits in the component containing the starting vertex 1. 
After having exhausted the first component, we select a vertex y such that 
LEVELfy) = — 1, and start with y as the root of a spanning tree in the second 
component. This procedure continues till there is no vertex left with — 1 as 
its LEVEL. 

Typically for a graph of 50 vertices and 132 edges the IBM 7044 (using 
FORTRAN IV) took second to generate a set of fundamental circuits. 

All Circuits in an Undirected Graph: All circuits of an undirected graph 
might be found by first forming the set of all linear combinations (i.e., ring 
sums) of the fundamental circuits and then discarding from this set all those 
circuit sets that contain other circuits. (This is because a linear combination 
of circuits can be either a circuit or a union of edge-disjoint circuits. And a 
union of edge-disjoint circuits contains other circuits.) Such an algorithm, 
however, would be very inefficient. From ju fundamental circuits 2 M — p — 1 
linear combinations must be made, and then each of these must be compared 
pairwise with every other to test for containment. Therefore, a different 
approach has to be taken. 

To reduce the storage requirement and the number of comparisons, one 
may be tempted to suggest discarding every edge-disjoint union of circuits as 
soon as it is generated. This approach, however, is faulty. For we might find 
that a genuine circuit was a combination of some discarded circuit set and 
another circuit generated later. 

Welch [11-68] proposed a scheme of ordering the fundamental circuits so 
that one could discard a union of edge-disjoint circuits as it is produced before 
generating all 2^ — // — 1 combinations and then make pairwise compari¬ 
sons. In a more recent paper [11-24], Gibbs has pointed out an error in 
Welch’s algorithm and has proposed a modification. Gibbs’s algorithm for 
generation of all circuits from a set of fundamental circuits is essentially an 
exhaustive method and requires storage proportional to 2^. Finding an effi¬ 
cient algorithm for identifying all circuits in a graph is an open problem. For 
a survey of circuit-generation algorithms see Prabhaker [11-50]. 

Algorithm 4: Cut-Vertices and Separability 

Having found out that a graph G is connected, the next question one is 
most likely to ask is: Is the graph G separable? That is, is there one or more 
cut-vertices in <7? If the answer is yes, one would like to find the cut-vertices 
and the blocks (maximal nonseparable subgraphs) of G. 

As pointed out in Section 4-5, cut-vertices are important in the study of 
vulnerability of a communication network. Moreover, this algorithm may serve 
as a subroutine for other algorithms, such as for planarity and isomorphism. 



SEC. 11-4 


SOME BASIC ALGORITHMS 


285 


Preliminary Simplification: In this algorithm, as in most others, it pays to 
perform some preliminary simplification. If the given graph has the possibil¬ 
ity of being disconnected, we could apply Algorithm 1 and consider each 
component as a connected graph. It will be a waste of computer memory and 
execution time to drag along all the components of a disconnected graph. 
Similarly, if the graph is not simple, we can immediately discard all the self¬ 
loops and parallel edges, since their presence or absence has no effect on 
separability. Third, if the graph has any pendant vertices, we can prune the 
graph by repeatedly deleting pendant vertices. (In the pruning process we 
must keep in mind that every vertex adjacent to a pendant vertex is a cut- 
vertex, except in the trivial case where the graph consists of just one edge.) 
Usually these simplifications will have substantially reduced the size of the 
original graph.t 

A straightforward method (which was used in [5-8], for example) for 
testing separability of a graph would be to remove each vertex in turn (by 
deleting the corresponding row and column from its adjacency matrix X) and 
then to test the resulting graph for connectedness, using Algorithm 1. But this 
is an inefficient method, and we can do better using a different approach, 
suggested by Read [11-53]. 

Let us recall a result from Chapter 4: Two edges are in the same block if 
and only if there exists at least one circuit that contains both these edges 
(Problem 4-10). At first sight it may appear that to use this characterization 
of a block we would have to generate all circuits—an obviously time- and 
storage-consuming affair. The following two results, however, reveal that it 
would suffice to generate only a set of fundamental circuits: 

Lemma 1 

A nonempty intersection of two fundamental circuits in a graph is always a 
path. 

Proof: With respect to some specified spanning tree T, let e x and e 2 be two 
chords forming fundamental circuits /, and f 2 , respectively. Then if /, n f 2 con¬ 
tains two edges, x and y, not connected in/j n f 2 , there is a path P x in f x between 
x and y (that is, a path between one of the end vertices of edge x and one of 
the end-vertices of edge;;); and this path does not contain chord e x . Similarly, 
there is a path P 2 in f 2 between x and y that does contain chord e 2 . Then the 
subgraph 

P X \J P 2 \J [x, y} 

contains a circuit without containing any chord, which is impossible. ■ 

|A preliminary simplification is to be performed only if it produces a net saving in 
running time. 
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Lemma 2 

In a graph G if edges a and b belong to a fundamental circuit f, and if edges 
b and c belong to another fundamental circuit f } such that a f } and c f, 
then there exists some circuit T in C such that a and c both are in F. 

Proof: The proof follows from Lemma 1, because f n f) is a path containing 
b but not a or c. Therefore,/- © /,- is a circuit (not an edge-disjoint union of circuits) 
containing a and c. ■ 

Description of the Algorithm: If we generate fundamental circuits one by 
one, and as each fundamental circuit is generated we labelt (or relabel) all its 
edges identically, using the following procedure, we will have identified the 
blocks in the graph: 

Each edge in the first fundamental circuit is labeled with 2’s. When the 
second fundamental circuit is found, it will have either all its edges unlabeled, 
or some of its edges would be labeled 2. In the former case, label every edge 
of the second fundamental circuit with 3’s, and in the latter case with 2’s. 
When this process reaches the rath (1 < m < e — « + 1) fundamental cir¬ 
cuit, we may find any one of three conditions: 

1. If every edge in the mth fundamental circuit is unmarked, label all of 
them with a new integer q + 1. 

2. If some edges in the mth fundamental circuit are marked u and all 
others are unmarked, label each of the unlabeled ones as u also. 

3. Suppose that some edges in the mth fundamental circuit are marked 
u, others v, and others w, . . . , and some are unmarked. Let u < v < 
w < ... . Then relabel all edges marked v, w, . . . , in G as u, and label 
all unmarked edges in the mth fundamental circuit as u also. 

When this process terminates, after having generated fundamental circuits 
and labeled the edges in each of them, the following has been accomplished: 

Every edge that belongs to a circuit has been labeled. Moreover, any two 
edges have the same label if and only if they are together in some circuit (not 
necessarily a fundamental circuit). In other words, each set of edges carrying 
identical labels constitutes a block. If there is more than one block in the 
graph, the graph is separable. Any vertex incident on edges with different 
labels is a cut-vertex. An edge that has not acquired any label is a bridge. (A 
bridge is an edge whose removal disconnects the graph.) 

In this algorithm an edge belonging to a circuit gets relabeled many 

fThis labeling can be conveniently performed using the adjacency matrix X and by 
writing over it. The edge between the /th and /th vertices is labeled q (q = 2, 3,..., 
e — n + 1), simply by replacing x, y - and x 7 , with q(xij only, if the upper triangle is used). 
Entries that are still l’s correspond to unlabeled edges. Others will have labels 2, 3, ..., 
and so on. 
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times—an obvious source of inefficiency. An improvement suggested by 
Paton [11-48] reduces the relabeling of edges by the following devices: Instead 
of relabeling the edges in a fundamental circuit as soon as it is generated, we 
wait till a vertex z in Algorithm 3 has been completely examined, and then 
assign identical labels to the fundamental circuits (passing through z) thus 
generated. Therefore, labeling has to be performed only n times and not p 
times. Moreover, we need not label every edge in the graph (Problem 11-11). 
It is left for the reader to construct a flow chart for the block-identification 
algorithm in which Algorithm 3 is completely embedded. Remaining details 
can be found in [11-48]. 

Using an entirely different approach, Hopcroft and Tarjan [11-31] and 
Tarjan [11-61] have given an algorithm which is faster than the algorithm 
described here for certain types of graphs. Their algorithm uses depth-first 
search on the graph (to be discussed later in this chapter), and the graph is to 
be input in the successor-listing form. Its execution time is proportional to 
e, whereas the time bound for the Read-Paton algorithm described here is 
proportional to n y , where 1 < y < 2, depending on the structure of the graph. 
Analysis and extensive tests show that for a typical graph of n vertices and e 
edges, Hopcroft and Tarjan’s algorithm outperforms (on IBM 7044) Paton’s 
algorithm as long as e < 5 n. For graphs of much higher densities (Problem 
11-1) Paton’s algorithm performed better. Thus for planar graphs (since 
e < 3/? — 6), Hopcroft and Tarjan’s algorithm will in general be faster. 
Typically, for an 80-vertex 400-edge graph, the IBM 7044 took about 7 se¬ 
conds for block identification with either of the two algorithms. 

Algorithm 5: Directed Circuits 

One of the most important things about a digraph is its directed circuits 
(also called cycles). The significance of directed circuits in many applications 
was discussed in Chapter 9. Unlike the case of undirected graphs, no tech¬ 
nique is known by which we can obtain a basic set of directed circuits such 
that every directed circuit in the digraph is obtained as a linear combination of 
this basic set. Therefore, Algorithm 3 is of little help in obtaining all directed 
circuits of a digraph. We must generate every directed circuit individually. 
For this we must examine each edge (unless the edge is known a priori to 
belong to no directed circuit) many times. 

Preliminary Simplification: Although it is not necessary, in most cases the 
prior application of the following two steps will simplify a given digraph. 
First, if the digraph is likely to be disconnected, use Algorithm 1 [with slight 
modification for a digraph (Problem 11-7)] to identify the connected com¬ 
ponents, and then consider one component at a time. Second, successively 
delete all vertices (and the edges incident on them) that have zero in-degree 
or zero out-degree. Clearly, such a vertex cannot lie in any directed circuit. 
These vertices are easy to identify because they correspond to entire rows 
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[for d + {v) 0] or columns [for d~(v ) = 0] of zeros in the adjacency matrix 

X. For example, if the digraph given was the one in Fig. 9-16, edges a, b, and 
h would be eliminated. Then in the next go-round edges e and c would be 
deleted, leaving us a digraph of only three edges, d , /, and g. On the other 
hand, this method of simplification will not reduce the digraph shown in Fig. 
9-21. 

Description of the Algorithm: This algorithm, first proposed by Roberts 
and Flores [11-56] and subsequently systematized by Tiernan [11-63], uses an 
exhaustive search to find all directed circuits in a given digraph G. As usual, 
the vertices of G are assigned integers 1, 2,...,/? as their names. The algo¬ 
rithm depends on starting from a vertex p , and building a directed path P = 
(/? x , /z 2 ,... ,p k ) until no further vertices (satisfying certain conditions) are 
“available” at vertex p k . At p k , when it is not possible to extend the directed 
path any further, the algorithm checks to see if there is a directed edge from 
p k to p v If there is such an edge, a directed circuit (p if p 2 ,... ,p k ,pf has 
been found and is duly recorded. If there is no such edge in the digraphs, we 
move back one vertex to />*_, and try extending the path again from p k _ { 
along a different edge (if there is one). Whether a directed circuit is found or 
not, the algorithm makes vertex p k forbidden for the next extension from 
/?*_, (thus avoiding going over the same path). 

This process of looking for directed .circuits and then moving back a ver¬ 
tex is continued till we finally backtrack to the vertex p l itself. Thus all direct¬ 
ed paths starting from p { have been examined and directed circuits recorded. 
Starting with the next vertex, the entire process is repeated. The iteration 
starts with vertex /?, = 1 and ends with /?,=«. 

In this exhaustive search for directed paths we must take the following 
precautions: 

1. In the process of extending each directed path, going round and round 
a directed circuit must be avoided. This is achieved by insisting that 
any vertex that has already been included in the directed path is “not 
available” for extending the path. 

2. Generating a directed circuit of q vertices q times—once at each vertex 
in the circuit—must be avoided. This is accomplished by insisting that 
no vertex i < p x is available for path extension, if the path begins with 
vertex p v This rule assures that the search for a particular directed 
circuit commences only when its lowest-numbered vertex is at the path 
initiation. 

3. The same path must not be considered more than once during the path 
extension. This is accomplished by keeping an updated list of forbidden 
vertices in a binary n by n matrix H = [h u \. The 1 entries in the ith row 
correspond to the vertices that are forbidden from vertex /(i.e., if vertex 
j is forbidden from vertex i, set h ij <— 1). AO entry indicates that the 
vertex is not forbidden (i.e., if h i} = 0, then vertex j is not forbidden 
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from vertex /). Matrix H is reset to zero each time a new vertex is 
chosen as the starting vertex. 

The digraph is inputted as its adjacency matrix [see Section 11 -2(a)]. The 
vertices are labeled as usual with integers 1, 2,. . . , n. The directed path under 
consideration is represented by a linear array 

p = (Pi, P 2 * ■ ■ ■ *Pk-\*Pk* o, 0, . . . , 0, 0) 

of order n. The first vertex of every path is p { and the last one is p k . 

I 


Fig. 11-6 Digraph. 

The algorithm can be best explained with an example. When applied to 
the digraph of Fig. 11-6, the following steps will be performed: 

Path P 

p i p2 p 3 Pa. p 5 Action on P 

10 0 0 0 
1 2 0 0 0 

1 2 4 0 0 

1 2 4 3 0 

12 4 0 0 

1 2 4 5 0 

12 4 0 0 
12 0 0 0 
12 5 0 0 
1 2 0 0 0 

10 0 0 0 

2 0 0 0 0 

2 4 0 0 0 

2 4 3 0 0 

2 4 0 0 0 

2 4 5 0 0 

2 4 0 0 0 

2 0 0 0 0 

2 5 0 0 0 

2 0 0 0 0 


Report circuit: ^ 4,3 <— 1 
Report circuit: /? 4 ,s 4 — 1 

No circuit: /? 4 , t 4 — 64,2 4 — //4,3 <— h 4,4 4 — /? 4 ,s 4 — 0; hi ,a «■— 1 

Report circuit: hi,s 4 — 1 
No circuit: pi *— p\ + 1; H * — 


Report circuit: set h a ,3 4 — 1 
No circuit: set /u.s 4 — 1 

No circuit: Ha, 1 4 — /? 4,2 4 —- /? 4,3 4 — /u,4 4 — /u.s 4 — 0; hi ,a 4 — 1 

No circuit: hi,% <— 1 

No circuit: hi , } <— hi ,2 4 — hi ,3 <— /?2,4 4 — ^ 2,5 4 — 0; h\,i <— 1 
No circuit: p\ <— p\ + 1; H * — 0 



0 
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Path P (Cont.) 


Pi Pi P 3 P 4 P 5 

Action on P 

3 

0 

0 

0 

0 

Report circuit: p\ <— p\ + 1 ; H <— 0 

4 

0 

0 

0 

0 


4 

5 

0 

0 

0 

No circuit: <— 1 

4 

0 

0 

0 

0 

No circuit: p i *— p i + 1 ; H <— 0 

5 

0 

0 

0 

0 

No circuit: stop 


The flow chart of this algorithm, which is a modified version of the algo¬ 
rithm given in [11-63], is shown in Fig. 11-7. 

You must have observed that this algorithm is nothing more than a sys¬ 
tematic and exhaustive search for directed circuits. As shown in the example, 
the same directed path is traversed many times. Even a directed circuit is 
usually examined and rejected several times before its turn to be accepted 
arrives. Consequently, the algorithm is very slow, and there is room for con¬ 
siderable improvement. To quote Tiernan [11-63], this algorithm “would be 
costly to utilize on a graph containing more than 50 arcs or 7 vertices”—a 
small graph indeed. 

The algorithm could be easily modified to generate all directed Hamil¬ 
tonian circuits. This, in fact, was the original purpose of the algorithm as 
reported by Roberts and Flores [11-56]. 

A random directed graph of 20 vertices, 55 edges, and 434 directed cir¬ 
cuits took about 17 seconds on the IBM 7044. This indicates that this method, 
involving a systematic but exhaustive search, is quite inefficient in terms of 
execution time. 

A similar algorithm but somewhat more involved, considerably faster, 
but requiring more storage was proposed by Weinblatt [11-67]. See also 
[11-50]. 


11 - 5 . SHORTEST-PATH ALGORITHMS 

A large number of optimization problems are mathematically equivalent 
to finding shortest paths in a graph. Consequently, shortest-path algorithms 
have been worked over more thoroughly than any other algorithm in graph 
theory. More than 100 papers have been published and dozens of algorithms 
have been proposed. Some of these algorithms are better than others, some 
are more suited for a particular structure than others, and some are only 
minor variations of earlier algorithms. For a good comparative study of 
various shortest-path algorithms through the year 1968, a survey paper by 
Dreyfus [11-17] is highly recommended. 
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Fig. 11-7 Algorithm 5: Directed circuits. 


There are different types of shortest-path problems. Most frequently 
encountered among these are the following five, of which we shall solve the 
first three: 

1. Shortest path between two specified vertices. 

2. Shortest paths between all pairs of vertices. 
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3. Shortest paths from a specified vertex to all others. 

4. Shortest path between specified vertices that passes through specified 
vertices. 

5. The second, third, and so on, shortest paths. 

In a worst-case situation, type 1 becomes identical to 3, because (as we shall 
see shortly) in the process of finding the shortest path from a specified vertex 
to another specified vertex, we may have to determine the shortest paths to 
all other vertices. Let us deal with type 1 first. 

Algorithm 6: Shortest Path from a Specified Vertex 
to Another Specified Vertex 

The problem of finding the shortest path from a specified vertex s to 
another specified vertex t , can be stated as follows: 

A simple weighted digrapht G of n vertices is described by an n by n 
matrix D = [d u ], where 

d u — length (or distance or weight) of the directed edge from vertex i to 
vertex j, d u > 0, 

da = 0 , 

d;j — oo, if there is no edge from i to j (in carrying out a program oo is 
replaced by a large number, say 9999999). 

In general, d u ^ d jn and the triangle inequality need not be satisfied .That is, 
du + d jk may be less than d ik . [In fact, if the triangle inequality is satis¬ 
fied, for every i,j, and k, the problem would be trivial because the direct edge 
(x, y ) would be the shortest path from vertex x to vertex y.] The distance of a 
directed path P is defined to be the sum of the lengths of the edges in P. The 
problem is to find the shortest possible path and its length from a starting 
vertex 5 to a terminal vertex t. 

Among several algorithms that have been proposed for the shortest path 
between a specified vertex pair, perhaps the most efficient one is an algorithm 
due to Dijkstra [11-16]. 

Description of the Algorithm: Dijkstra's algorithm labels the vertices of 
the given digraph. At each stage in the algorithm some vertices have perma¬ 
nent labels and others temporary labels. The algorithm begins by assigning 

|If the given digraph is not simple, it can be simplified by discarding all self-loops, 
and replacing every set of parallel edges by the shortest (least-weight) edge among them. 
Also, the graph need not be directed. For an undirected graph dij = dj;, and effectively 
each undirected edge is replaced by two oppositely directed edges of the same weight. 
If the graph is not weighted, assume d t j = 1, and the adjacency matrix becomes the dis¬ 
tance matrix. 
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a permanent label 0 to the starting vertex s, and a temporary label oo to the 
remaining n — 1 vertices. From then on, in each iteration another vertex 
gets a permanent label, according to the following rules: 

1. Every vertex j that is not yet permanently labeled gets a new temporary 
label whose value is given by 

min [old label of j, (old label of i + d tJ )\, (11-1) 

where / is the latest vertex permanently labeled, in the previous itera¬ 
tion, and d;j is the direct distance between vertices / and j. If i and j are 
not joined by an edge, then d u = oo. 

2. The smallest value among all the temporary labels is found, and this 
becomes the permanent label of the corresponding vertex. In case of 
a tie, select any one of the candidates for permanent labeling. 

Steps 1 and 2 are repeated alternately until the destination vertex t gets a 
permanent label. 

The first vertex to be permanently labeled is at a distance of zero from s. 
The second vertex to get a permanent label (out of the remaining n — 1 
vertices) is the vertex closest to s. From the remaining n — 2 vertices, the 
next one to be permanently labeled is the second closest vertex to s. And so 
on. The permanent label of each vertex is the shortest distance of that vertex 
from 5 . This statement can be proved by induction (Problem 11-13). As an 
illustration of Dijkstra’s procedure, let us find the distance from vertex B to 
G in the digraph shown in Fig. 11-8. We shall use a vector of length seven to 
show the temporary and permanent labels of the vertices as we go through 
the solution. The permanent labels will be shown enclosed in a square, and 
the most recently assigned permanent label in the vector is indicated by a 
tickffl/ The labeling proceeds as follows: 


A 

*y 

c 

D 

E 

F 

G 


OO 

e 

oo 

OO 

oo 

OO 

OO * 

Starting Vertex B is labeled 0. 

7 

a 

l s 

OO 

oo 

OO 

OO \ 

All successors of B get labeled. 

7 

El 

ffl 

OO 

oo 

OO 

oo ; 

Smallest label becomes permanent. 

4 

E 

m 

OO 

5 

4 y 

oo • 

Successors of C get labeled. 

4 

E 

m 

OO 

5 

a 

oo 


4 y 

E 

m 

14 

5 

a 

11 


a 

E 

m 

14 

5 

si 

11 


ffl 

E 

m 

12 

5 y 

ffl 

11 


a 

E 

ffl 

12 

ffl 

ffl 

11 


a 

E 

ffl 

12 

a 

ffl 

7 y 


a 

E 

m 

12 

a 

si 

ffl: 

Destination vertex gets permanently 


labeled. 
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Fig. 11-8 Simple weighted digraph. 


All steps are easily programmed except for the job of distinguishing the 
permanently labeled vertices from the temporarily labeled ones, which is 
slightly tricky. An efficient method of accomplishing this is to associate 
indices 1, 2,.. . , n with the vertices, and keep a binary vector VECT of 
order n. When the ith vertex becomes permanently labeled, the ith element 
in this binary vector changes from 0 to 1. 

A flow chart of this algorithm is given in Fig. 11-9, and a FORTRAN 
listing of the program is provided at the end of the chapter. 

The algorithm described does not actually list the shortest path from the 
starting vertex to the terminal vertex; it only gives the shortest distance. The 
shortest path can be easily constructed by working backward from the ter¬ 
minal vertex such that we go to that predecessor whose label differs exactly 
by the length of the connecting edge. (A tie indicates more than one shortest 
path.) Alternatively, the shortest path can be determined by keeping a record 
of the vertices from which each vertex was labeled permanently. This record 
can be maintained by another linear array of length n, such that whenever 
a new permanent label is assigned to vertex j, the vertex from which j is di¬ 
rectly reached is recorded in the y'th position of this array. 

Remarks 

1. In this algorithm, had we continued the labeling until every vertex got 
a permanent label (rather than stopping at the permanent labeling of the 
destination vertex t ), we would have gotten an algorithm for the shortest 
paths from starting vertex s to all other vertices. A computer program for this 
purpose, written in ALGOL, is given in [11-6]. 

2. If we take a shortest path from the starting vertex s to each of the other 
vertices (which are accessible from j 1 ), then the union of these paths will be an 
arborescence T rooted at vertex s. Every path in T from s is the (unique) 
shortest path in the digraph (or graph, as the case may be). Such a tree is 
called the shortest-distance arborescence (and shortest-distance tree in an 
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undirected graph—not to be confused with the shortest spanning tree intro¬ 
duced in Chapter 3). This arborescence may be constructed as a by-product 
in Algorithm 6, if the labeling is continued till every vertex gets a permanent 
label, and if each time a vertex is labeled permanently, the corresponding edge 
is added to the arborescence. For example, the shortest-distance arborescence 
of Fig. 11-8 is given in Fig. 11-10. 



3. In this algorithm, as more vertices acquire permanent labels the num¬ 
ber of additions and comparisons needed to modify the temporary labels 
continues to decrease. In the case where every vertex gets permanently la¬ 
beled, we need n(n — l)/2 additions and 2 n(n — 1) comparisons. Thus the 
computation time is proportional to n 2 . 

4. Notice that for a given n the computation time is independent of the 
number of edges the digraph may have. This is because it is tacitly assumed 
that the digraph is complete—each missing edge is simply given a very large 
weight. This observation is also borne out by the following typical data: On 
the IBM 7044, for a random digraph of 80 vertices and 3200 edges, it took §§ 
second to find the shortest distance from a given vertex to all others. Another 
random graph with 80 vertices but only 1000 edges also took second for the 
same computation. 

5. If the digraph is sparse [i.e., the number of edges e is much smaller than 
n(n — 1)], it is possible to reduce the time of computation. This can be 
achieved by incorporating another test which alters the temporary labels of 
only those vertices that are successors of the most recent permanently labeled 
vertex. There is, of course, a trade off here between the time taken for testing 
and the time that is saved as a result of this test. An ALGOL program of 
Dijkstra’s algorithm, which takes advantage of the sparseness of the graph, is 
given in [11-38], pages 43-44. For another technique that reduces computa¬ 
tion time in sparse graphs, see remark 3 in Algorithm 7. 

6. If the given digraph G is not weighted, every edge in G has a weight of 
one, and matrix D is the same as the adjacency matrix. Then the problem is 
simpler. We perform logical operations rather than real arithmetic. 

7. We have assumed the distances d u are all nonnegative numbers. If some 
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of the distances are negative, Algorithm 6 will not work. (Negative distances 
in a network may represent costs and the positive ones profits.) The reason 
for the failure of Algorithm 6 is that once a vertex is permanently labeled its 
label cannot be altered. Shortest-path algorithms have, however, been pro¬ 
posed (see [11-17]) that will solve this problem, provided the sum of all d u 
around every directed circuit is positive. (The problem has no solution if a 
negative-weight circuit having a vertex on a directed path from s to t exists, 
because then one could continue minimizing the distance to — oo by going 
round and round this circuit.) The computation time of the existing algo¬ 
rithms that can handle negative du is n 3 and not n 2 . 

8. It was suggested by T. A. J. Nicholson that carrying the shortest-path 
algorithm simultaneously from both ends s and t would improve the speed. 
Dreyfus [11-17] has, however, shown that the double-ended procedure would 
improve the efficiency only in certain types of digraphs. In the case where 
nearly all n vertices must be permanently labeled from either one end or the 
other, the double-ended procedure is actually less efficient than Dijkstra’s 
one-ended procedure. For an ALGOL listing of Nicholson’s double-ended 
program see Algorithm 22 in [11-6]. 

Algorithm 7: Shortest Path Between All Pairs 
of Vertices 

Sometimes one is interested in finding the shortest paths between all 
n(n — 1) ordered pairs of vertices in a digraph (or n(n — l)/2 unordered 
pairs of vertices in an undirected graph). If we were to use Algorithm 6 for 
this purpose, the computation time would be proportional to « 4 . There are 
several algorithms available that can do better. Among these, two are con¬ 
sidered best, both being equally efficient. One is due to Dantzig [11-15] as 
improved by Tabourier [11-60]; the other one is due to Floyd [11-21], based 
on a procedure by Warshall [11-65]. Both algorithms require computation 
time proportional to n 3 . We shall describe the Warshall-Floyd algorithm. 

Description of the Algorithm: The algorithm works by inserting one or 
more vertices into paths, whenever it is advantageous to do so. 

Starling with the n by n matrix D = [<7 iV ] of direct distances, n different 
matrices D,, D 2 ,..., D„ are constructed sequentially. Matrix D*, 1 k ^ 
n, may be thought of as the matrix whose (/,y)th entry gives the length of the 
shortest directed path among all directed paths from i to j, with vertices 1, 
2 ,,k allowed as the intermediate vertices. Matrix = [dff] is construct¬ 
ed from according to the following rule: 

dff = min [d { i j~ i) , (d%~ l) + d ( k k f l) )\, 
k = 1,2 
d\T = d u . 


for 

and 


(H-2) 
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That is, in iteration 1, vertex 1 is inserted in the path from vertex i to j if 
dij > d iX + d Xj . In iteration 2, vertex 2 is inserted, and so on. 

Suppose, for example, that the shortest directed path from vertex 7 to 3 
is 7 4 1 9 5 3. The following replacements occur: 

Iteration 1 : d l $ is replaced by (d^ + A 0 ?)- 
Iteration 4: Al' is replaced by {d { 7 \ + All¬ 
iteration 5: A 4 3 is replaced by (d ( $\ + Adu¬ 
lteration 9: AV is replaced by (AV + A®*)- 

Once the shortest distance is obtained in A A the value of this entry will not 
be altered in subsequent operations. 

The flow chart of the algorithm is given in Fig. 11-11 and a FORTRAN 
listing is given at the end of the chapter. Its ALGOL listing can be found in 
[ 11 - 21 ]. ^ 

The algorithm described so far does not actually list the path; it only gives 
the shortest distances. Obtaining the path is slightly more involved than in 
Algorithm 6 , because now there are n(n — 1) paths required, not just one. 
An efficient method of obtaining the intermediate vertices in each of the 
shortest paths is by constructing a matrix Z = [z (7 ] (referred to as the optimal- 
policy matrix ), such that entry z u is the first vertex from / along the shortest 
path from i to j. The optimal-policy matrix Z can be constructed as follows: 
Initially we set 

z u = h if d(j ¥=■ °°, 

= 0, if d u — oo. 

In the ki\\ iteration if vertex k is inserted between i and /, element z i} is re¬ 
placed by the current value of z ik , for all i and j. This updating of the Z 
matrix is done during each iteration k, where k = 1, 2,At the end, the 
shortest path (/, v x ,v 2 ,, v q J) from i to j is derived as a sequence of vertex 
numbers from matrix Z as follows (see Problem 11-15): 

^2 z vi ,y> 


J Z v Q ,j- 




Fig. 11-11 Algorithm 7: Shortest path between every vertex-pair. 
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Remarks 

1. Notice that for computational purposes we need memory space for 
only one n by n matrix. Other constructed matrices can be overwritten on 
this matrix. 

2. To estimate the execution time, note that we have to construct n ma¬ 
trices D 15 D 2 ,. . . , D„, sequentially. For each matrix the number of ele¬ 
ments to be computed is (n — 1)(« — 2), because in Eq. (11-2) / ^ j, i ^ k, 
j ^ k (although, for simplicity, in the flow chart we have not taken advantage 
of this slight saving). Thus the execution time is proportional to n(n — 1) 
(n — 2) ~ n 3 . 

3. Whenever d ( £~ x) — oo in Eq. (11-2), it is possible to circumvent n — 1 
additions and comparisons in exchange for an additional test. This is a trade 
off just as in Algorithm 6, but since the execution time for Algorithm 7 is 
proportional to n 3 , it pays to include this extra test for almost every digraph. 
(Note that the test is not included in the flow chart.) 

4. If the graph is sparse, that is, the number of edges are far fewer than 
n(n — 1), it is possible to take advantage of the special structure and reduce 
the labor by decomposing the graph. Shortest paths are obtained in each 
subgraph and these are put together to obtain the shortest paths in the entire 
graph. As an extreme example, consider the case in which a digraph of n 
vertices consists of the two digraphs of nj 2 vertices each. The number of com¬ 
putations reduces from n 3 to 2(n/2) 3 , a reduction of 75 per cent. See Hu and 
Torres [11-33] for decomposition algorithms. 

5. As in Algorithm 6, if the digraph is unweighted, that is, D = X, the 
computational time can be reduced by replacing the arithmetic operations 
with logical operations. 

Transitive Closure of a Digraph: Let G be a simple, /r-vertex digraph. Let 
us construct another simple n-vertex digraph by adding edges to G as follows: 
Add an edge (/, j ) directed from vertex i to j if and only if there is a directed 
path (of any length 2, 31) from / to j in G. Digraph H is called the 
transitive closure of G. In other words, 

X(H) = R(G), 

where X(//) is the adjacency matrix of H and R(G) is the reachability (or 
accessibility) matrix of G. 

It is easy to see that the transitive closure of a given digraph G can be 
obtained by applying ALGORITHM 7 to the adjacency matrix X(G), i.e. by 
setting D <— X. The time taken by this method of obtaining transitive closure 
is proportional to n 3 . We can, however, do better. For discussions of more 
efficient algorithms for transitive closure see [11-45]. 
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Longest-Path Analysis: Sometimes, notably in critical path analysis of 
activity networks (see Chapter 14), one needs the longest paths (rather than 
the shortest) from a specified vertex to all others. One would expect that 
maximization procedures analogous to the minimization procedures in 
Algorithms 6 and 7 would yield the desired paths. But for an arbitrary digraph 
this will not work. For in the process of maximization, one could go round 
and round a directed circuit and the length would be made arbitrarily large. 
Another difficulty is the following: In the shortest-path problem, if (s, t, 
u ,...,/) is a shortest directed path from s to t , then the subpath ( t , «,...,/) 
is a shortest path from t to /. It is this property on which the shortest-path 
algorithms are based. On the other hand, if (s, t, u ,...,/) is a longest directed 
path from s to/, there may well exist a directed path from t to/via s that is 
longer than the subpath ( t , u,... ,f). 

Both these difficulties disappear if the given digraph is acyclic (which is 
the case for activity networks). Dijkstra’s algorithm can then be used to find 
longest paths from a given vertex to all others in an acyclic digraph. The 
details are left as an exercise (Problem 11-16). 

In addition to the three shortest-path problems dealt with in Algorithms 
6 and 7, there are several other shortest-path problems. For example, one 
may be interested in finding the second-shortest path from s to/. Or one may 
be interested in finding a shortest path from s to/that passes through certain 
specified vertices. For these and more, [11-17] and [11-3] are recommended, 
while we move on to an altogether different problem. 


11-6. DEPTH-FIRST SEARCH ON A GRAPH 

In this section we shall discuss a powerful technique of systematically 
traversing the edges of a given graph such that every edge is traversed exactly 
once and each vertex is visited at least once. This technique, called the depth- 
first search (DFS) or backtracking on a graph was first formalized and used 
by Hopcroft and Tarjan [11-31] and was subsequently studied in some depth 
by Tarjan [11-61]. 

It is evident that for answering almost any nontrivial question about a 
given graph G we must examine every edge (and in the process every vertex) 
of G at least once. For example, before declaring a graph G to be disconnected 
we must have looked at every edge in G; for otherwise, it might happen that 
the one edge we had decided to ignore could have made the graph connected. 
The same can be said for questions of separability, planarity, and the like. 

There are two natural ways of scanning or searching the edges of a graph 
as we move from vertex to vertex: (i) once at a vertex v we scan all edges 
incident on v and then move to an adjacent vertex w. At w we scan all edges 
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incident on w. This process is continued till all edges in the graph are scanned. 
This method of fanning out at each vertex is referred to as the breadth-first 
search of the graph. This was the method used in Algorithm 3. It was also 
employed implicitly in Algorithms 1 and 6. (ii) An opposite approach is 
instead of scanning every edge incident on vertex v, we move to an adja¬ 
cent vertex w (a vertex not visited before) as soon as possible, leaving v with 
possibly unexplored edges for the time being. In other words, we trace a walk 
through the graph going on to a new vertex whenever possible. This method of 
traversing the graph, called the depth-first search (DFS), has been found to be 
very useful in simplifying many graph-theoretic algorithms, because of the 
resulting numbering of the vertices and orientations imposed on the edges. 

Numbering Vertices and Orienting Edges in DFS: During a DFS on a 
graph, whenever a vertex v is visited for the first time we assign it a distinct 
serial number NUM(u), so that NUM(V) = / if v was the ith vertex to be 
visited during the traversal. Also an orientation is imposed on each edge 
along the route of the traversal. When the search terminates the undirected 
graph G on which the DFS was being performed, becomes a digraph G with 
its vertices numbered 1,2,...,«. The details of the DFS algorithm can be 
best described by the following steps: 

Description of the DFS Algorithm: Let G be the given undirected graph, 
inputted as neighbor listings (i.e., representation (e) in Section 11-2). Let x 
be the specified vertex from which the search is to begin. PALM and FRON 
are two disjoint subsets into which the edges of G are to be partitioned. 

Step 1: Set v <— x, i * — 0, PALM <— 0, FRON <— 0 

Step 2: Set i *— i + 1, NUM(v) <— / 

Step 3: Look for an untraversed edge incident on v. 

(a) If there is no such edge (i.e., every edge incident on v has 
already been traversed), go to Step 5; otherwise, 

(b) Pick the first untraversed edge at v, say ( v , w), and traverse 
this edge. Orient the edge (v, w) from v to w. Now you are at 
vertex w. 

Step 4: (a) If w is a vertex which has not been visited before during this 
search (that is, if NUM(w) is undefined), add edge (v, w) to 
the set PALM. Set v *— w and go to Step 2. 

(b) If w is a vertex which has been visited earlier (that is, NUM 
(w) < NUM(v)), add edge (v, w) to the set FRON. Go to 
Step 3. You are back at vertex v. 

Step 5: Check to see if there exists some traversed edge (w, v) in set PALM 
oriented toward v. 

(a) If there is such an edge move back to vertex u. (Note that u 
is the vertex from which v was visited for the first time.) Set 
v — u and go to Step 2. 
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(b) If there is no such edge (u, v), stop (we are back at root x, 
having traversed every edge and visited every vertex con¬ 
nected to x). 

The DFS algorithm just described is illustrated by an example in Fig. 
11-12. In the given graph G of five vertices and eight edges, the starting vertex 
jc is specified. The order in which the edges are explored is given in Fig. 11-12 
(b), and for this order of traversal G is given in Fig. 11-12(c). 


c b 



Graph G before DFS 
(a) 


(x,a): Branch 
(a,b): Branch 
(b,x): Frond 
(b, c): Branch 
(c, x): Frond 
(<\ a): Frond 
(b,d): Branch 
(d, *): Frond 


Order in which edges 
were scanned. 

(b) 


4 5 



Graph G after DFS 
(c) 


Fig. 11-12 Depth-first search on a graph. 


Palm Tree and Fronds: 


It is not difficult to see that if this DFS procedure is applied to any connected, 
undirected graph G (with n vertices and e edges), it will terminate after num¬ 
bering the vertices 1, 2,. . . , n and orienting every edge in G. Let G be the 
resulting digraph. Consider PALM, the set of n — 1 oriented edges, each of 
which led to a new vertex during the DFS. This subdigraph (defined by the 
edge set PALM) is a spanning arborescence in G, because every vertex in this 
subdigraph, except the root x, has an in-degree equal to one, and the in¬ 
degree of * is zero. (Review Sec. 9-6, to recall arborescence and its properties.) 
This spanning arborescence is referred to as a palm tree. Edges not in the 
palm tree (i.e., edges belonging to the set FRON) are called fronds. Since for 
every frond (a, b ) vertex b was visited before a , NUM(Zj) < NUM(a). In 
other words, every frond is oriented from a higher-numbered vertex to a 
lower-numbered vertex. 

The DFS by itself does not reveal properties of a given graph G (except 
whether or not G is connected). What it does, however, is to number the 
vertices in a systematic manner and partition the edges into two sets PALM 
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and FRON, with properties just discussed. It is this which makes DFS a 
powerful tool in the construction of efficient algorithms for solving a sur¬ 
prisingly large number of graph-theory problems. The following are some of 
the problems for which algorithms have been constructed employing DFS. 

(1) Identification of components. (2) Identification of blocks and cut- 
vertices, [11-61]. (3) Identification of maximal subgraphs of connectivity three 
or more [11-32]. (4) Planarity [11-31], and [11-62]. (5) Isomorphism of planar 
graphs [11-32], (6) Identification of fragments (i.e., maximal strongly connect¬ 
ed subgraphs) in a digraph [11-61]. 

It has been shown (see the appropriate reference) that the computation 
time of all these six algorithms is proportional to e , the number of edges, if 
the graph is given in the neighbor-listing form of Sec. 11-2(e). And since 
every edge must be examined at least once, this is also the lower bound on 
these algorithms, disregarding multiplicative constants, of course. 

We shall now sketch the planarity algorithm, a problem to which DFS has 
been applied with spectacular success, resulting in drastic improvements in 
computation time over earlier methods. 

Algorithm 8: Planarity Testing 

The problem of determining whether or not a given graph is planar is an 
important one. As pointed out in Chapter 5, the planarity characterizations 
of Kuratowski, Whitney, or MacLane (although theoretically elegant) are 
unsuitable for testing by a computer. They are difficult to implement; and, 
besides, if a graph is planar, these methods do not yield a plane representa¬ 
tion, which is often what is needed. It has been shown, for example, that if 
Kuratowski’s characterization is used to test planarity of an ^-vertex graph 
{n > 5), the computation time is at least proportional to n 6 (see [5-8]). 

In recent years many algorithms for planarity testing have been proposed 
and programmed on computers (see [5-8] for a survey). Most of these methods 
employ the map-construction approach, which works as follows: A planar 
subgraph g (in most algorithms g is a circuit) of the given graph G is first 
selected and mapped on a plane. Then gradually the remaining edges are 
added to g, such that no crossings occur. If we succeed in the reconstruction, 
graph G is obviously planar, and we have obtained a plane representation of 
G. If we fail, G is nonplanar. 

The only difficult part of such an algorithm is that in the early stages of 
adding edges to g we have choices available (i.e., ambiguity) in placement of 
edges. A wrong choice made earlier may later prevent us from adding an 
edge, even if the graph is planar. For example, in Fig. 11-13(a), suppose that 
the starting subgraph g was the circuit [e { , e 2 , e 3 , e 4 , e 5 }. Then we add edges 
e 6 , e n , e 8 , and e 9 , without any crossover. Now we find that the last edge e 10 
cannot be added without a crossing. From this we might erroneously con- 
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(a) 


Fig. 11-13 



elude that G is nonplanar. On the other hand, had we selected a different face 
for placing vertex v, we would have obtained a planar representation, as 
shown in Fig. 11-13(b). 

This essentially is the problem in the map-construction method of plan¬ 
arity testing, and different procedures have been devised to solve it. 

Preliminary Simplification: As pointed out in Section 5-5, an arbitrary 
graph can, in general, be reduced to a much smaller graph if subjected to cer¬ 
tain simplifying steps. These steps do not affect the planarity (or the non¬ 
planarity) of a graph. 

1. Apply Algorithm 1 to check for connectedness. If the graph is discon¬ 
nected, consider only one component at a time. 

2. Remove all self-loops, and replace each set of parallel edges by a single 
edge. 

3. Eliminate every vertex of degree two by merging the two edges incident 
on the vertex. Apply steps 2 and 3 alternately and repeatedly, till the 
graph cannot be reduced any further. 

4. Apply Algorithm 4 to partition the graph into its blocks (i.e., maximal 
nonseparable subgraphs). 

5. Subject each block to reduction steps 3 and 2 alternately till no further 
reduction is possible. 

6. Each simplified block thus obtained, with e edges and n vertices, is 
tested for 
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If any of these three inequalities is not satisfied, our job is finished, 
and we move on to the next block. Every graph with n < 5 or e < 9 
is planar, and every simple graph with e > 3n — 6 is nonplanar. 

One planarity testing algorithm due to Bruno, Steiglitz, and Weinberg 
[5-2] goes a step further in simplification. Each nonseparable graph is further 
broken down into its maximum 3-connected subgraphs (called 3-connected 
components). Then the following result, due to W. T. Tutte, is used: A graph 
is planar if and only if all of its 3-connected components are planar. This 
algorithm, however, is not as efficient as the one due to Hopcroft and Tarjan 
[11-31], which will be described next. 

Description of the Algorithm: The planarity-testing algorithm is quite 
involved. We shall sketch only its essential features. To understand the main 
algorithm, let us consider the following decomposition procedure applied to 
a given simple, nonseparable graph G with n vertices and e edges: 

Circuit-Path Decomposition: 

Step 1: Find some circuit K in G. Set g *— K. Label the vertices and edges 
of g as v x v 2 , . . ., and e n e 2 ,, respectively. Set i *— 1. 

Step 2: If there is an unlabeled edge in G, find a path p l that begins and 
ends at labeled vertices but consists only of unlabeled edges. Store p r If there 
is no unlabeled edge left in G, go to step 4. 

Step 3: Set g g u Set i *— / + 1. Label the unlabeled edges and 
vertices in g. Return to step 2. 

Step 4: Stop. Print g, /?,, p 2 ,. . ., p m . 

It can be shown ([5-8]) that the procedure just outlined decomposes the 
simple, nonseparable graph G into one circuit and m = e — n paths. Since 
the circuit may be looked upon as two edge-disjoint paths, G is thus decom¬ 
posed into e — n + 2 paths. It may be noted that although such a decompo¬ 
sition of a graph G may not be unique, the number of paths into which G is 
decomposed is constant and equals one circuit and e — n paths. For example, 
in Fig. 11-13 consider two distinct decompositions, each with one circuit and 
four (10 — 6 = 4) paths: 

{^1? ^2’ ^3> ITio}’ (^7 > ^ 9 }’ }» 

and {e n e 2 ',e 3 ,e 4 ,e 5 }, {e 7 , e 8 ], [e 9 ], {e 6 }, {e, 0 }. 

In this circuit-path decomposition, we can map the circuit K on a plane, 
and continue to add new paths p x , p 2 , . . . , as they are generated. A new path 
p. will either divide an existing face into two new faces, or will make g U p t 
nonplanar, when added to the planar map g. This method of arbitrarily add¬ 
ing paths as they are generated may lead to a situation shown in Fig. 11-13, 
which has already been discussed. To solve this problem, one can either 



SEC. 11-6 


DEPTH-FIRST SEARCH ON A GRAPH 


307 


1. Continue adding paths till no path can be added. Then backtrack to 
explore the alternative choices he could have made earlier, or 

2. Continue to look at different paths but not add them to K, till it is 
found which face a path must be placed in, or it is ascertained that it 
does not matter which face the path is placed in. 

Some algorithms (see [11-53]) use approach 1, but Hopcroft and Tarjan 
[11-31] have used approach 2 and have shown that their algorithm is more 
efficient because of it. They use list processing and have an elaborate program 
(985 lines of ALGOL). The gist of their technique of resolving ambiguity in 
adding paths, is as follows: 

Suppose that at any stage we have a path (on top of a pushdown list of 
paths) whose ambiguity we are trying to resolve. Let a and b be the end 
vertices of p t . The flow chart in Fig. 11-14 shows the different cases that may 
arise and what action is taken for each. These steps are explained by means 
of Fig. 11-15. 

In Fig. 11-15(a) path p { can be swiveled at vertices a and b , and therefore 
can divide either the face “above” or “below” the path p } . This ambiguity of 
path Pi must be resolved. To resolve this ambiguity, starting from some ver¬ 
tex x on path p t a new path p k is constructed. (The path p k consists of un¬ 
labeled edges and it terminates as soon as it touches a labeled vertex.) 

If both end vertices x and y of p k are on path as shown in Fig. 11-15(b), 
p k can be swiveled at vertices a: and y and thus divides either of the two faces— 
one “above” p ( and the other “below.” Thus not only did we not resolve the 
ambiguity in placement of path p p but we have a new path p k whose ambigu¬ 
ity must be resolved first. Path p k is put above p. in the stack, and we begin 
resolving its ambiguity just as we were doing for p P 

Another possibility is that the end vertex y is neither on path p t nor on 
Pj but on a different path, as shown in Fig. 11-15(c). In this case p t cannot be 
swiveled, and therefore there is no ambiguity as to which face p t divides. 

As in Fig. 11-15(d), if path p t ends on a vertex in path p 3 between a and b, 
then Pi (together with p k ) can still be swiveled about a and b. Therefore, the 
ambiguity in p. remains. The path p k , however, divides the face a xb y a 
unambiguously. Therefore, we shall have to generate another path from a 
vertex on p t for resolving ambiguity in the placement of p r 

Finally in the case where path p k ends on a vertex on p 3 but not between 
a and b , we have the situation shown in Fig. 11-15(e). The path p t can still be 
swiveled. But, unlike Fig. 11-15(b), there is no ambiguity in path p k with 
respect to p r Therefore, a new path must be generated to resolve ambiguity 
in p p and that path can be generated starting from any vertex in path y a xb, 
which is path extended up to vertex y on pj. 

In summary, the planarity algorithm consists of routines (a) for finding 
blocks (i.e., maximal nonseparable subgraphs), (b) for partitioning each block 




Fig. 11-14 


Resolution of ambiguity of path p t . 


Store p r and 
resolve ambiguity 
of the next path 
in the stack 


308 









SEC. 11-6 


DEPTH-FIRST SEARCH ON A GRAPH 


309 



Fig. 11-15 Resolution of ambiguity of path p ( . 


into paths, (c) for deciding where to add these paths, and (d) for building a 
planar representation. The DFS is used in both (a) and (b). The paths generat¬ 
ed from the resulting palm tree and fronds have several important properties 
which are made use of in (c) and (d). The time required for each, except (d), 
is proportional to the number of edges, e. The building of planar representa¬ 
tion requires time proportional to n log n, n being the number of vertices. 
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Thus the theoretical time bound for the entire algorithm is proportional to 
n log n. (Because of Euler’s equation e is proportional to n, in graphs 
subjected to planarity algorithm.) 

This algorithm has subsequently been improved and was reported in 
Tarjan’s Ph.D. Thesis [11-61]. The time taken by the improved version is 
proportional to just n. A somewhat simplified form of the improved version 
appeared as a Cornell University technical report [11-61]. 

11-7. ALGORITHM 9: ISOMORPHISM 

The graph isomorphism problem is to determine if there exists a one-to 
one correspondence between the vertices of two graphs G, and G 2 that pre¬ 
serves the adjacency of vertices. The problem of graph isomorphism arises in 
many fields, such as chemistry, switching theory, information retrieval, and 
linguistics. Consequently, the isomorphism problem has been studied exten¬ 
sively. For a survey and references, see Corneil’s Ph.D. thesis [11-9], and 
[ 11 - 12 ]. 

Theoretically, it is always possible to determine whether or not two graphs 
G, and G 2 are isomorphic by keeping G, fixed and reordering vertices of G 2 
to check if their adjacency matrices become identical. This process may re¬ 
quire all n \ reordering and comparisons, n being the number of vertices. Such 
an inefficient procedure, in which the running time grows factorially with n, 
is of limited use for practical problems. An algorithm guaranteeing a solution 
in running time proportional to a constant power of n is desirable, but no such 
algorithm has been discovered for determining if two arbitrary graphs are 
isomorphic.t 

There are, however, efficient isomorphism algorithms available for cer¬ 
tain types of graphs. Some of these are 

1. Isomorphism in trees: [11-9], pages V-44-V-52. 

2. Isomorphism in planar graphs: [11-32]. 

3. Isomorphism in graphs containing no ^-strongly regular subgraphs 
for large k: see [11-9] for definition of a ^-strongly regular graph. 

4. Isomorphism in partially labeled graphs with special structure: [11-59] 
and [11-64]. (See also Chapter 15 in this book.) 

Heuristic Procedure 

If two given graphs G, and G 2 are arbitrary, the usual approach is first 
to try to show if G, and G 2 are not isomorphic. This is done by asking ques- 

tThere are problems for which only factorially or exponentially growing algorithms 
exist, and better ones may never be found. In such cases, one does have to live with an 
inefficient algorithm and use it for small graphs. 
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tions of the following type: 

1. Do G, and G 2 have the same number of vertices? 

2. Do G, and G 2 have the same number of edges? 

3. Is the number of vertices n t with degree i the same in both, for / = 1, 
2 9 

4. Do both graphs have the same number of components? 

5. For each component are questions 1, 2, and 3 answered in the affir¬ 
mative? 

6. Are the characteristic polynomialst of their adjacency matrices X(G,) 
and X(G 2 ) the same (taken in the field of real numbers)? 

And so on. 

Clearly, if the answer to any of these questions is no, G, and G 2 are non¬ 
isomorphic. A yes answer, however, does not guarantee an isomorphism. 

An invariant of a graph G is a number that is the same for all graphs 
isomorphic to G. Some examples of invariants are number of vertices «, 
number of edges e, rank R, nullity / 1 , number of components p, connectivity, 
and coefficients in the characteristic polynomial of the adjacency matrix. A 
complete set of invariants is a set of invariants that completely describes a 
graph within isomorphism. 

The problem of graph isomorphism is solved if we can find a complete 
set of invariants for G x and G 2 and then check to see if they are identical. 

The problem of finding a complete set of invariants can also be thought 
of as the coding of a graph. If we could find a complete set of invariants, we 
would place them in a sequence. This sequence would contain all the essential 
information about a graph. Two graphs would be isomorphic if and only if 
their codes were the same. The problem of coding trees and some other types 
of graphs has been solved, [11-54], [11-66]. But there exists no method of 
coding an arbitrary graph with a large number of vertices. 

Numerous heuristic procedures have been proposed (and programmed) 
based on the idea that if you compute many invariants, and if they are the 
same for both G, and G 2 , it is likely that G, and G 2 are isomorphic. Heuristic 
approaches work well for graphs of small orders. For example, it can be 
shown that two simple, connected graphs with n 7 which have affirmative 
answers for questions 3 and 6 are isomorphic. But these heuristic algorithms 
fail for large arbitrary graphs. 

Sometimes heuristic procedures are also used for simplifying the last- 
resort method involving vertex-by-vertex correspondence between G t and 

tit is not difficult to show that the characteristic polynomial, det(X — Al), is indepen¬ 
dent of the order in which the vertices appear in the adjacency matrix X. 
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G 2 . For example, the number of comparisons can always be reduced from n\ 
to 

i= 1 
k 

where X n i = n - 

i- 1 

This is because in reordering the vertices in G 2 it suffices to permute only 
vertices with the same degree. For a graph that does not have a large percent¬ 
age of its vertices with the same degree, the number X n r ' s much smaller 
than n\. 

11-8. OTHER GRAPH-THEORETIC ALGORITHMS 

The algorithms described so far, although very important and basic, are 
only a few samples out of scores of graph-theoretic algorithms available in 
the literature. Obviously, it is not possible to include them all. Some of the 
other commonly used algorithms for graphs are 

1. Find all fragments (i.e., maximal strongly connected subgraphs) in a 
given digraph G [11-61]. 

2. Find a Hamiltonian path (if there is one) in a given undirected graph. 
See [11-49]. 

3. Find all directed Hamiltonian circuits in a given digraph. See [11-13], 
pages 35-37. 

4. Find a maximal complete subgraph (clique) in a given graph G. See 
[11-1] and [11-44]. 

5. Find a maximal matching in a bipartite graph. An algorithm, known 

as the Hungarian method, is often used for solving this assignment 
problem. The Hungarian method is available as a standard subroutine 
in most operations research computer program packages. See [9-4], 
pages 265-269. The computation time for the Hungarian method, for 
a graph of n vertices and e edges, is proportional to e-n. Recently a 
more efficient algorithm has been proposed by Hopcroft and Karp 
[11-30]; it takes time proportional to (e + . 

6. Given a connected weighted digraph G in which the weight of an 
edge represents the maximum rate of flow through that edge, find 
the maximum possible flow from a vertex a: to another vertex y in G. 
This is the well-known maximum-flow problem and is solved by the 
Ford-Fulkerson algorithm (discussed in Sections 4-6 and 14-1). The 
Ford-Fulkerson algorithm is also a standard program in operations 
research. See [9-4], For a more recent and improved algorithm see 
Edmonds and Karp [11-19]. 
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7. Find the chromatic number of a given graph. See [11-26]. 

8. Given an acyclic digraph, perform a topological sorting of its vertices. 
See Section 14-8 for definition, an algorithm, and references. 

9. Given a connected weighted graph (7, partition the vertices of G into 
subsets no larger than a given size so as to minimize the total weight 
of edges cut in the process. For a heuristic algorithm to solve this 
important but difficult problem, see [11-36]. Also, see [11-35]. 

10. Given a connected weighted graph G, find a Hamiltonian circuit with 
smallest weight. This is the traveling-salesman problem, for which no 
satisfactory solution has been found so far. For heuristic algorithms 
see [2-1]. 

11. In a given graph G = (V, E), find a smallest subset (i.e., subset of 
minimum cardinality) of edges E' <= E such that every vertex of G is 
incident on at least one of the edges in £'. This problem of finding a 
smallest covering was discussed in Section 8-5. See [11-18] for an 
efficient algorithm. 

12. In a given graph G = (V, E), find a smallest subset of vertices V' 
such that every vertex of G not included in V is adjacent to at least 
one vertex in V. This problem of finding a smallest dominating set 
was discussed in Section 8-2. Superficially similar to 11, this problem 
is much more difficult. 

13. In a given digraph G, find a smallest set of edges which when deleted 
from G would destroy all directed circuits. A method of finding this 
minimum decyclization edge set (or feedback are set) was outlined 
in Section 9-11, but the computation time grew exponentially with n, 
the number of vertices. 

14. In the previous problem instead of finding a smallest set of edges we 
now wish to find a smallest set of vertices in G whose removal destroys 
all directed circuits in G. Just as for 13 no efficient algorithm is known 
for this problem. 

15. Given a connected weighted graph G = (V, E), and a subset V' of 
V. Find a minimal tree in G which spans the vertices in V (and 
possibly some more). Such a tree, known as a Steiner tree, is much 
more difficult to obtain than a minimal spanning tree for G. The 
difficulty arises from the fact that inclusion of extra vertices may lead 
to a different tree with a smaller total weight. Although no efficient 
algorithm has been found for an exact solution of this problem, an 
approximate solution can be obtained by an efficient algorithm given 
by S. K. Chang (in “The Generation of Minimal Trees with a Steiner 
Topology,” J. ACM, Vol. 19, No. 4, October 1972, 699-711). 
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11-9. PERFORMANCE OF GRAPH-THEORETIC ALGORITHMS 

As observed in Section 11-7, for a given graph-theory problem it would 
be desirable to have an algorithm which guarantees a solution in an execution 
time proportional to some constant power of n or e (as usual, n and e are the 
number of vertices and edges, respectively, in the given graph). In other 
words, the execution time t (for the worst possible graph) can be expressed as 

t 5S a/i* or t ^ fie 9 , 

and the lower the value of k and q the better. Such an algorithm (whose com¬ 
putation time is bounded by a polynomial in n or e) is called a polynomial 
bounded algorithm. For example, Algorithm 1 (connectedness and compo¬ 
nents) is polynomial-bounded, since k = 2. Algorithms 2, 3, 4, 6, 7, and 8 
are also polynomial-bounded, but Algorithms 5 and 9 are not. Some impor¬ 
tant polynomial-bounded graph-theoretic algorithms along with their bounds 


Problems 

Run-Time Bounds 

Relevant References 

Connectedness and 
components 

n 2 or e 

[11-29], [11-61] 

Spanning tree 

e 

[11-58] 

Minimal spanning tree 

n 2 

[11-22], [11-70] 

Fundamental circuit-set 

n v , 

2 ^ v ^ 3 

[11-47] 

Cut-vertices and blocks 

n 2 or e 

[11-48], [11-61] 

Bridges 

n 2 or e 

[11-10], [11-61] 

Shortest path between two 
vertices 

n 2 

[11-16], [11-73] 

Shortest paths between 
all vertex-pairs 

n 3 

[11-21], [11-73] 

Transitive closure 

n* 

2 < a < 3 

[11-45] 

Strong connectedness 
and fragments 

n 2 or e 

[11-29], [11-61] 

Planarity 

e 

[11-62] 

Topological sorting 

e 

[11-39] 

Maximal matching in 
a bipartite graph 

« 5/2 

[11-30] 

Minimal cut 

nP 

2 < 0 < 3 

[11-19] 

Minimal edge cover 

n 3 

[11-18], [11-46] 


Table 11-1 Polynomial-Bounded Algorithms 
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and relevant references are shown in Table 11-1. Note that since for a simple 
graph e 5S n{n — l)/2 g n 2 /2 , bounds in terms of e and n are convertible 
into each other. It should also be kept in mind that different algorithms 
bounded by the same power of n may take very different amounts of actual 
computer time (for the same graph) because of their different multiplicative 
constants. 

On the other hand, there are graph-theoretic problems for which it is 
simply not possible to have a polynomial-bounded algorithm. Take, for exam¬ 
ple, the problem of generating all spanning trees of a given graph, as dis¬ 
cussed in Section 11-4. The number of spanning trees in an w-vertex simple, 
labeled graph can be as high as n"~ 2 . Therefore, if each spanning tree were 
generated in c units of time, the algorithm to generate all spanning trees would 
consume c-n"~ 2 units of time. Thus no polynomial-bounded algorithm can 
be found for this problem. Similar arguments hold for problems of generating 
all cliques, all circuits/directed circuits, all paths, all cut-sets, and so forth, 
for a given graph. 

There is a third category of graph-theoretic problems, for which so far no 
polynomial-bounded algorithms have been discovered, nor has it been pos¬ 
sible to show that polynomial-bounded algorithms do not exist for these 
problems. Detection of isomorphism (Algorithm 9) is one such problem. A 
list of important problems of this type is given in Table 11-2. The computa¬ 
tion time for solving these problems (using the best available algorithm at 
present, and the worst possible graph) grows exponentially or factorially (but 
not polynomially) with n. Such inefficient algorithms are obviously of very 
limited use for practical problems. Heuristic techniques are the mainstay of 
their solutions. 

Based on a remarkable result of Stephen Cook (1971), Richard Karp 
[11-34] showed the following surprising result: Except for the isomorphism 
problem, all other problems in Table 11-2 are polynomially equivalent, that 
is, if a polynomial-bounded algorithm exists for one, polynomial-bounded 


Problems 

Relevant References 

Chromatic number 

[11-26], [11-41] 

Smallest dominating set 

[11-44] 

Maximal clique 

[11-20] 

Hamiltonian circuit 

[11-56] 

Directed hamiltonian circuit 

[11-56] 

Traveling salesman problem 

[11-28] 

Minimal feedback edge-set 

[9-6] 

Minimal feedback vertex set 

[9-6] 

Steiner tree 

Chang in Section 11-8 

Isomorphism 

[11-9], [11-12], [11-64] 


Table 11-2 Nonpolynomial Algorithms 
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algorithms can be found for the others. The proof of this equivalence is 
involved and is not very relevant for us here. (The Cook-Karp class of prob¬ 
lems includes a number of other combinatorial and graph-theoretic prob¬ 
lems in addition to the top nine in Table 11-2.) 


11-10. GRAPH-THEORETIC COMPUTER LANGUAGES 

The increasing interest in graph-theoretic computations has led to the 
development of several programming languages for the sole purpose of han¬ 
dling graphs. The major goal of such a language is to enable the user to for¬ 
mulate operations on graphs in a compact and natural manner, as if he were 
communicating with another graph theorist. For example, in one such 
language [11-37], the statement SPANTREE @ G would call the subroutine 
for a spanning tree and would find a spanning tree of the graph G. Another 
statement IF (G, PLANAR) 17, 3 would transfer control to statement 17 if 
G is planar, otherwise to statement 3. 

Once such a language is developed and implemented, its advantages are 
enormous. It makes the writing of graph-theory programs easy and compact. 
It frees the user from having to concern himself with many unnecessary 
details and allows him to concentrate on the essential features of his program. 
The disadvantage of such a language, as of all special-purpose languages, is 
that it takes a great deal of time, trouble, and expense to develop such a 
language, which can be used only for the purpose of writing programs in 
graph theory. 

For such a graph-theoretic computer language to be useful to many users, 
with different problems in graph theory, the language must have a large num¬ 
ber of primitives (i.e., basic graph-theoretic statements) such as “remove 
vertex k of G ”, “add an edge between vertices x and y", or “find the shortest 
distance from vertex u to vertex v in G ". Moreover, the graph-theoretic 
language must have all the computing facilities of an existing symbolic lan¬ 
guage such as FORTRAN, so that the programmer can perform functions 
which are not covered by primitives. Since there is little to be gained by devel¬ 
oping an entirely new language from scratch, all graph-theoretic languages 
available and being developed are extensions of some well-known program¬ 
ming language. 

Some graph-theoretic computer languages available at present are 

1. Graph-Theoretic Language (GTPL) at the University of West Indies, 
Jamaica [11-52], It is an extension of FORTRAN. 

2. Graph Algorithm Software Package (GASP) at the University of 
Illinois [11-7]. It is an extension of PL/1. 
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3. HINT at Michigan State University [11-27] is an extension of the list¬ 
processing language LISP 1.5. 

4. GRASPE at the University of Houston, [11-23] and [11-51], is also an 
extension of LISP 1.5. 

5. Directed Graph Processor (DIP) at Carnegie-Mellon University is an 
extension of PL/1. (See a report by Terry C. Gleason, 1969.) 

6. An Interactive Graph Theory System at the University of Pennsylvania 
is an extension of FORTRAN [11-71]. 

7. Graphic Extended ALGOL (GEA) at Instituto di Elettrotecnica ed 
Elettronica del Politecnico di Milano, Italy is an extension of ALGOL: 
[11-13] and [11-14]. 

8. AMBIT/G, developed by C. Christensen for manipulation of digraphs, 
is an extension of AMBIT [11-57]. 

9. GIRL—Graph Information Retrieval Language [11-4]. 

10. FORTRAN Extended Graph Algorithmic Language (FGRAAL) at 
the University of Maryland [11-2], 

The interested reader should consult the cited references for the details of 
these languages. 

SUMMARY 

Computational aspects of graph theory were presented in this chapter. 
For anyone interested in applying graph theory to solve physical problems 
(such as flow problems, assignment problems, identification of a chemical 
compound, topological analysis of an electrical network, layout of a printed- 
circuit board, etc.), it is essential to be able to enlist the help of the digital 
computer. Without the help of high-speed electronic computers, he cannot 
hope to handle a graph of a size generally encountered in solving a nontrivial 
practical problem. 

A computer that has been “taught” elementary graph theory (such as, 
find out if graph G is separable, or pick out a spanning tree in G) can be of 
immense aid even to a “pure” graph theorist. It can, for instance, relieve him 
of the drudgery of finding graphs with special properties to serve as examples 
and counterexamples. 

To be able to teach graph theory to a computer, one must obviously know 
both computer programming and graph theory. In addition, one must find 
efficient algorithms. The most important prerequisite of any useful graph- 
theoretic algorithm is that the running time of its program on the computer 
must not rise factorially or even exponentially with n. It should be propor¬ 
tional to n k , where k is some fixed number—preferably a low number. 
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PROBLEMS 

11-1. For studying the behavior of an algorithm, random graphs are often used. A 
graph is random if its edges are drawn at random from the set of all distinct pairs 
of vertices. Write a subroutine for generating simple random graphs of a given n 
and density, where density is defined as the ratio e/n{n — 1) for a directed graph 
and 2 ejn{n — 1) for an undirected graph. {Hint: Use an appropriate pseudo¬ 
random-number generator, and obtain the adjacency matrix.) 

11-2. Write subroutines for converting the following graph representations: 

(a) Adjacency matrix to incidence matrix. 

(b) Incidence matrix to adjacency matrix. 

(c) Adjacency matrix to edge listing. 

(d) Edge listing to successor listing. 

(e) Successor listing to adjacency matrix. 
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11-3. Write a program in the assembly language of the machine you may have to pack 
a given adjacency matrix X(n, n). Assume that the given matrix uses full words for 
each entry whether it is 0 or 1. 

11-4. Write a FUNCTION subprogram ADJ, which when supplied with the subscripts 
O',/) gives the yth entry of the (packed) adjacency matrix as its value. Assume 
that the packed adjacency matrix is in a COMMON area. 

11-5. The algorithm for connectedness given in the text modifies the adjacency matrix. 
How will you restore it? (Hint: Observe that only one row of the adjacency matrix 
is getting changed at a time.) 

11-6. Analyze to determine when it would be profitable to determine the vertex of 
maximum degree in each component before fusion in Algorithm 1. Assume the 
graph is given in the form of an adjacency matrix. 

11-7. Will Algorithm 1 require any modification to find the components of a digraph? 
Also write a program for identifying all fragments of a digraph. 

11-8. In Algorithm 3, after an edge (z, p) has been considered it is deleted from the 
graph; that is, in adjacency matrix X entries x 2 , p and x p , 2 are made zero. What 
could be done if one wanted to avoid modifying X? 

11-9. While considering the edge (z, p) if we found that p was already in the tree, we 
went ahead to discover the fundamental circuit. Let L = LEVEL(z) — LEVEL(p). 
Prove that the fundamental circuit will contain all and only edges on the path of 
length L from z to the root of the tree, apart from the two edges (z, p) and 
(p, PRED(/0). 

11-10. Use the following convention in drawing the tree developed by Algorithm 3. 
Draw the tree downward from the root, and add vertex p at depth LEVEL(/>) 
below the root. When vertex z is being examined, the edges (z, p) are added from 
left to right. If z is the vertex under examination, define the trunk as the path 
in the tree T from z to the root. Then show that the vertices in T fall into four 
classes: 

1. Vertex z that is being examined. 

2. Vertices below z—unexamined and added during the examination of z. 

3. Vertices to the “left of” the trunk—unexamined and at a distance of one from 
the trunk. 

4. Vertices on and “to the right of” the trunk—examined. 

11-11. In Algorithm 4 let /j be a fundamental circuit found while examining vertex z u 
and f 2 be a fundamental circuit found later while examining z 2 . If /i n / 2 ^ 0, 
show that the tree edge (z,, PREDCz,)) is in both /, and / 2 . Is this why it was 
not necessary to label the chords? 

11-12. In Algorithm 5 suggest a quick way of testing whether a vertex v, which is being 
considered for extension, is already in the path array P. [Getting every p(i) for 
i = l,..., k, where k is the length of the path built so far, and comparing with 
v is obviously bad.] Give a flow chart for the algorithm incorporating this. 

11-13. In Algorithm 6 suppose you suspect that the graph is disconnected and the starting 
and the terminal vertex may not be in the same component. The flow chart given 
in Fig. 11-9 works for this case also, but it is inefficient. Suggest a method of 
speeding up the detection in this case. (Hint: You will have to add a test box at 
an appropriate place in the flow chart, on the outcome of which you would decide 
whether or not to continue.) 

11-14. Modify Algorithm 6 so that it lists all shortest paths from 5 to t. 



CHAP. 11 


APPENDIX OF PROGRAMS 


323 


11-15. Given the optimal-policy matrix Z as mentioned in Algorithm 7, write a subroutine 
that returns the shortest path (/,... J) for a specified / and j, as a sequence of 
vertex numbers. 

11-16. Write a program similar to Algorithm 6 to obtain the longest distance from a 
vertex s to all vertices accessible from 5 in a given acyclic digraph. 

11-17. Algorithm 7 can also be used for detecting whether or not a given graph is con¬ 
nected. (An oo in matrix D fc represents nonexistence of a path.) As presented, 
the algorithm is inefficient for this purpose. Rewrite Algorithm 7 solely for the 
purpose of identifying various components in a graph. {Hint: Use X rather than 
D and logical operations rather than arithmetic ones.) Compare its efficiency with 
that of Algorithm 1. 

11-18. Give an algorithm to find a cut-set with respect to a given pair of vertices a, b. 
Assume that the graph is given in terms of its F-H representation. You are allowed 
to scan the F-H arrays only twice. [Hint: If there is no edge {a, b), add one. Choose 
the edge {a, b). Shrink an edge e in G not parallel to {a, b) by fusing its end vertices. 
Some edges might now become parallel; continue shrinking.] 

11-19. Given an edge e*, write an efficient algorithm to determine if e* is a bridge. {Hint: 
Edge e* is a bridge if and only if {e*} is a cut-set. Use the result of Problem 11-18.) 

11-20. Write a program for generating all spanning trees, based on Minty’s method 
described in Section 11-4. 


APPENDIX OF PROGRAMS 

Program listings of some of the algorithms described are given in the 
following pages. Program 11-1 is written in APL\360; the rest are in 
FORTRAN IV in the form of subroutines. The variables in the subroutines 
are dimensioned such that a graph with no more than 100 vertices can be 
given as input. 

Program 11-1: Connectedness and Components 



[1] SL*-R*-/N 

[2] C<—1 

[3] LOOP1 : l*-Dif/D*—+ /X 

[4] VA—X[I;] 

[5] VA[l]t—1 

[6] LOOP2 : VN<-VA 

[7] VA*-VAv (v/X [ ;VA/ S L]) 

[8] —»(0#+/VN a VA)/LOOP2 

[9] VN<—~VA 

[10] 'COMP ';C;'; VERT: ';VA/R 

[11] —>(0-+/VN)/HALT 

[12] R*—VN/R 

[13] NA*-VN/SL 

[14] X*-X[NA;NA] 

[15] SL*-i/?R 

[16] C*-C + 1 

[17] -»LOOP1 

[18] HALT: 'HALT' 
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X Adjacency matrix of g 

N Number of vertices in G 

C Label of a component 

I Vertex with maximum degree in g 

VA Ith row of X 

VN Logical complement of VA 

R List of vertices in g 

NA Vertices not adjacent to 1 

SL Relabeled list of vertices in g 


Program 11-2: Spanning Tree/Forest 

SUBROUTINE SPTREE ( F , H , N , E , EDGE , C ) 

INTEGER C , E , EDGE(E) , F(E) , H(E) , VERTEX(IOO) VI , V2 
DO 4 L = 1 , N 
4 VERTEX(L) = 0 
DO 6 L = 1 , E 
6 EDGE(L) = 0 
C = 0 
M = 0 
K = 0 

10 K = K + 1 
VI = F(K) 

I = VERTEX(VI) 

IF ( I .EQ. 0 ) GO TO 39 
V2 = H(K) 

J = VERTEX(V2) 

IF ( J .EQ. 0 ) GO TO 36 
IF ( I - J) 21, 50, 18 

18 IJI = J 
J = I 
I = IJI 

21 DO 26 L = 1 , N 

IF ( VERTEX(L) - J ) 26, 23, 25 

23 VERTEX(L) = I 
GO TO 26 

25 VERTEX(L) = VERTEX(L) - 1 

26 CONTINUE 

DO 32 L = 1 , E 

IF ( EDGE(L) - J ) 32, 29, 31 

29 EDGE(L) = I 
GO TO 32 

31 EDGE(L) = EDGE(L) - 1 

32 CONTINUE 
C = C - 1 
EDGE(K) = I 
GO TO 49 
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36 EDGE(K) = I 
VERTEX(V2) = I 
GO TO 49 
39 M2 = H(K) 

J = VERTEX(V2) 

IF ( J .EQ. 0 ) GO TO 45 
EDGE(K) = J 
VERTEX(VI) = J 
GO TO 49 
45 C = C + 1 
EDGE(K) = C 
VERTEX(VI) = C 
VERTEX(V2) = C 

49 M = M + 1 

50 IF (M .EQ. (N - 1) .OR. K .EQ. E) RETURN 
GO TO 10 

END 


Program 11-3: Fundamental Circuits 


SUBROUTINE FCRKTS ( X , N , NULTY ) 

INTEGER CIRKIT(IOO) , LEVEL(IOO) , P , PRED(IOO) , PREDOP 
1 , TW(100) , X(N,N) ,Z 

NULTY = 0 
DO 5 L = 1 , N 
5 LEVEL(L) = - 1 
NROOT = 1 
7 ITW = 1 

TW(1) = NROOT 
LEVEL(NROOT) = 0 
10 IF ( ITW .EQ. 0 ) GO TO 38 
Z = TW(ITW) 

LVLSUC = LEVEL(Z) + 1 
DO 35 P = 1 , N 
IF ( X(Z,P) ) 35 , 35 , 15 

15 IF ( LEVEL(P) + 1 .NE. 0 ) GO TO 21 
TW(ITW) = P 
ITW = ITW + 1 
PRED(P) = Z 
LEVEL(P) = LVLSUC 
GO TO 33 

21 NULTY = NULTY + 1 
PREDOP = PRED(P) 

M = 1 

CIRKIT(I) = Z 
J = Z 
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26 J = PRED(J) 

M = M + 1 
CIRKIT(M) = J 

IF ( J .NE. PREDOP ) GO TO 26 
M = M + 1 
CIRKIT(M) = P 

PRINT 1000 , NULTY , ( CIRKIT(J) , J = 1 , M ) , CIRKIT(I) 

33 X(Z,P) = 0 
X(P,Z) = 0 
35 CONTINUE 

ITW = ITW - 1 
GO TO 10 

38 DO 39 NROOT = NROOT , N 

39 IF ( LEVEL(NROOT) .EQ. ( - 1 ) ) GO TO 7 

40 RETURN 

1000 FORMAT ( 4H THE I 4 , 24H FUNDAMENTAL CIRCUIT IS ( 20 I 4 ) ) 
END 

Program 11-4: Shortest Distance from stol 

SUBROUTINE DYSTRA ( D , N , S , T , LABELT ) 

C 

C 9999999 IS OUR INFINITY 
C 

DIMENSION LABEL(IOO) 

INTEGER D(N,N) , P , S , T , VECT(100) , Z 
DO 6 L = 1 , N 
LABEL(L) = 9999999 
6 VECT(L) = 0 
LABEL(S) = 0 
VECT(S) = 1 
I = S 

10 M = 9999999 

DO 18 J = 1 , N 

IF ( VECT(J) .EQ. 1 ) GO TO 18 

Z = D(I,J) + LABEL(I) 

IF ( Z .LT. LABEL(J) ) LABEL(J) = Z 
IF ( LABEL(J) .GT. M ) GO TO 18 
M = LABEL(J) 

P = J 

18 CONTINUE 
VECT(P) = 1 

IF ( P .EQ. T ) GO TO 23 

I = P 

GO TO 10 
23 LABELT = LABEL(T) 

RETURN 

END 
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Program 11-5: Shortest Path Between Every 
Vertex Pair 

SUBROUTINE FLOYD ( D , N ) 

C 

C 9999999 IS OUR INFINITY 
C 

INTEGER D(N,N) , S 
DATA INFNTY / 9999999 / 

DO 12 K = 1 . N 
DO 12 I = 1 , N 

IF ( D(I,K) .EQ. INFNTY ) GO TO 12 
DO 11 J = 1 , N 

IF ( D(K,J) .EQ. INFNTY ) GO TO 11 
S = D(I,K) + D(K,J) 

IF ( S .LT. D(I,J) ) D(I,J) = S 

11 CONTINUE 

12 CONTINUE 
RETURN 
END 



|A GRAPHS IN SWITCHING AND 
I Jm CODING THEORY 


The emphasis in the previous chapters has been on introducing more 
concepts of graph theory. Some applications were given, but mainly to make 
the concepts clearer. In the remaining chapters we shall discuss in detail some 
applications. 

In Section 7-5 we saw how the configuration of a switching network 
inside a black box could be determined with the help of graph theory. Again, 
in Section 8-5 a minimal cover of a graph led to the minimization of a 
switching function. In this chapter graph theory will be applied to study 
switching networks further. 

Switching theory came into being with the publication of Paul Ehrenfest’s 
paper in 1910, in which he suggested that Boolean algebra could be applied 
to automatic telephone exchanges. The first mathematical formulation of the 
behavior of a contact network (a particular type of switching network) was 
given by C. Shannon in 1938. Since 1938, switching theory has developed 
rapidly. Originally, it was intended to provide the communications engineer 
with tools for analysis and synthesis of large-scale relay switching networks, 
such as a telephone exchange. In recent years, however, the enormous growth 
of switching theory has been mainly motivated by its use in the design of 
digital computers. 

Unlike the signals in a classical electrical network (say, in a radio receiver), 
switching network signals have only two values—designated as 0 and 1. 
Switching networks are designed to process and store such binary signals. 

A switching network can be classified as either a combinational network 
or a sequential network. A combinational switching network is one whose 
output at a given time depends only on the input at that time. A sequential 
switching network, on the other hand, is one whose output at a given time is a 
function of the input at that time and during its entire past history. In other 
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words, a sequential network has memory, whereas a combinational network 
does not. All digital systems, from the largest multimillion-dollar computer to 
the smallest desk calculator, are constructed from these two basic types of 
circuitry—combinational and sequential. 

A combinational switching network can further be classified as (1) a 
contact network, or (2) a gate network (see [12-5], page 77). It is in the study 
of contact networks that graphs appear as the most natural representation of 
the switching network, as we shall see in the next section. Although attempts 
have been made, little has been accomplished by the use of graph theory in 
gate networks (see Section 9-3 of [1-13]). We shall therefore confine ourselves 
to the contact-type networks in this chapter. 

12-1. CONTACT NETWORKS 

A relay contact (or a contact , for brevity) can be thought of as an ordinary 
household switch used for controlling the light. It is a two-terminal device 
having two states; in the open state there is no conductive path between the 
terminals; in the closed state there exists a path that will allow the electric 
current to flow in either direction. Thus a contact is a bilateral device. 
Usually, a contact is represented by one of the symbols shown in Fig. 12-1. 



Fig. 12-1 Symbols used to represent a switch or contact. 


A contact network is a network of interconnected contacts (see Fig. 12-2 
for an example). Every contact network can be represented by a graph, in 
which the edges are the contacts and the vertices are the terminals. In fact, for 
our purpose, the following is the definition of a contact network: A contact 
network is an undirected, connected graph G (with no self-loops) in which 
each edge has a binary variable x l associated with it, which can assume only 
two values, 1 or 0. The binary variable x t assigned to a contact is 1 when the 
contact is closed and is 0 when the contact is open. 

The input-output behavior of a contact network is usually expressed in 
the form of functions, 

//(*!, * 2 , • • •»**). 

of the binary variables. Such a function/, is called a switching {or Boolean) 
function and must itself assume a value of 0 or L Boolean t (or switching) 

fSwitching algebra, as defined here, is actually a special case of Boolean algebra. 
However, in switching theory these two terms are often used interchangeably, as it causes 
no confusion. 
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algebra, which is used in expressing and manipulating switching functions, is 
defined as follows: 

A Boolean algebra (like rings and fields in Chapter 6) consists of a finite 
set x,, x 2 , . . . , x k and two binary operations + (called Boolean addition) 
and • (called Boolean multiplication) satisfying the following postulates: 


1. Either x t = 1 or x : = 0. 

2. For every x, there exists another variable x\, called the complement of 
x ( , such that if x t — 0, x' t — 1 , and if x, — 1 , x'i = 0. 


3. (a) Sum x. + x ; 
(b) Product x t Xj 


JO, if Xi = Xj = 0, 

[ 1, otherwise; 

J 1, if X; = Xj — 1, 

|0, otherwise. 


With these simple postulates a number of interesting results can be 
derived, which are very useful in the simplification of switching expressions. 
For example, it can be easily shown that jc. + x ( Xj = x r 

In contact networks one encounters two types of problems—the problem 
of analysis and the problem of synthesis. In analysis we are given a contact 
network G and are asked to find conditions under which there will be an 
electrically conducting path between a pair of vertices (v n Vj) in G. In synthe¬ 
sis, on the other hand, we are asked to design (as cheaply as possible) a 
network that can meet the given requirements. We shall deal with the problem 
of analysis first and then with that of synthesis. 


12-2. ANALYSIS OF CONTACT NETWORKS 

Consider any two vertices in a contact network G. Since G is connected, 
there are one or more paths between these two vertices. Each of these paths 
can be identified by the Boolean product of the variables associated with the 
edges in the path. For example, in Fig. 12-2 the eight distinct paths between 
vertices a and b are 


(X,X 5 ), (jfjX'a*,), (x 2 x 3 x 1 ), (x 2 x 3 x' 3 x s ), 

(x 2 x' 1 x 1 l (x 2 x' l x' 3 x 5 ), (x 3 x 4 Xj), (x 3 x 4 x' 3 x s ). 


( 12 - 1 ) 


Each of these products is called a path product between vertices a and b in the 
contact network G. 

Clearly, the value of a path product is 1 if and only if each variable in the 
path product has a value of 1; otherwise, it is 0. The value 1 of a path product 
implies the existence of an electrically conducting path between a and b 
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Fig. 12-2 Contact network with six 
vertices and nine contacts. 


through the corresponding contacts in the network. For an electrical conduc¬ 
tion between the two vertices, it is necessary and sufficient that at least one of 
the path products be 1. In other words, the Boolean sum of all path products 
between a specified pair of vertices (v n Vj) is 1 if and only if the terminals v. 
and Vj are electrically connected in the contact network. Therefore, the 
Boolean sum of path products is referred to as the transmission of the contact 
network between the two specified vertices. For example, the transmission 
between vertices a and b in Fig. 12-2 is 


Fab = *1*3 + *1*3*! + *2*3*1 + *2*3 *3 *5 +*2*1*1 
+ * 2 *'i*3*5 + *3*4*1 + *3*4 *3*5- 


( 12 - 2 ) 


Finding the transmission between specified vertices in a given contact 
network consists of enumerating all paths between the two vertices, and 
finding the Boolean sum of the path products. Furthermore, possible sim¬ 
plifications based on the postulates of the Boolean algebra are also performed. 
For example, in the path products listed in (12-1), the following identities are 
evident: 


*1*3*1 =*i* 3 . 


x 2 x 3 x 3 x 5 = 


* 2 * 1*1 


0 , 

0 , 


and 


x 3 x^x’ 3 x 3 = 0. 


Therefore, the switching function between vertices a and b in Fig. 12-2 is 


Fab = *1*5 + *1*3 + *,* 2 *3 + *1*2*3 * 5 + *,*3*4- (12-3) 

Clearly, F ab gives all different conditions under which a conductive path 
exists between a and b. 
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Normal Form: A switching function can be expressed in many different 
forms. For example, another way of expressing (12-3) is 

F ab = *,(* 5 + x '3 + x 3 (x 2 + x 4 )) + x',x 2 x 3 x 5 , (12-4) 

A Boolean function F(x ,, x 2 , , x m ) of m binary variables x,, x 2 ,, . ., x m 

when expressed as a sum of products (Boolean, of course) of the variables is 
said to be in the normal or natural form. Function F ab in (12-3) is in normal 
form, but in (12-4) it is not in normal form. 

Occasionally, one is interested in finding the transmissions between all 
pairs of vertices in a given contact network G. The result is best expressed as 
an n by n matrix called the transmission matrix T = [t t] ] of G, where n is the 
number of vertices, and t u is the transmission between vertices i and / in G. 
Clearly, T is a symmetric matrix with every diagonal entry t u = 1. 



The transmission matrix for the contact network shown in Fig. 12-3 is 


1 

2 

3 

4 


1 

1 


x, + x 2 x 3 
x 2 + x,x 3 

x\(x 2 + X t X 3 ) 


X, + x 2 x 3 

1 

x 3 + x,x 2 
x' 4 (x 3 + X j X 2 ) 


3 4 

x 2 + x,x 3 x 4 (x 2 + X,x 3 )’ 
x 3 + x,x 2 x 4 (x 3 + x,x 2 ) 
1 x 4 

x 4 1 


The determination of a transmission matrix involves enumeration of all 
paths between every vertex pair in a network. A better method of determining 
a transmission matrix is from the primitive connection matrix, defined as 
follows: 

The primitive connection matrix Q = [q i] ] of an n -vertex contact network 
G is an n by n matrix, whose elements q u are defined as 


q,i = 1 , for all /, 

q u = 0 , if vertices / and j are not directly joined by a contact; other¬ 
wise, 

q u = Boolean sum of the variables associated with all edges directly 
joining vertices i and j. 


The primitive connection matrices for the contact networks in Figs. 12-3 and 
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12 - 2 , respectively, are 





1 

2 3 

4 





1 

" 1 

x, x 2 

0 " 





2 

Xi 

1 x 3 

0 





3 

X 2 

x 3 1 

x 4 

> 




4 

_0 

0 x ' 4 

1 _ 




a 

b 

c 

d 


e 

/ 

a 

~ 1 

0 

X, 

0 


x 2 

•^ 3 ' 

b 

0 

1 

*5 

X, 


0 

0 

c 

x, 

*5 

I 

x ' 3 


0 

0 

d 

0 

X, 

*3 

1 

x'l 

4- x 3 

x 4 

e 

x 2 

0 

0 

x', + x 3 


1 

0 

f 

_*3 

0 

0 

x 4 


0 

1 


The primitive connection matrix is also symmetric, and it contains the 
complete information about a contact network. 

Let Q* be the Arth Boolean power of Q (i.e., Q multiplied by itself k times, 
using the rules of Boolean algebra, as defined in Section 12-1) for some 
positive integer k. Furthermore, let each entry in Q* be simplified, such as 

X + 1 = X + x' — 1, XT' = 0, 
and x + x = xx = x + xy = x. 

Then examine the ijth entry in the simplified Q*. What we have done amounts 
to tracing all edge sequences of length 1 , 2 , . . . , k between the vertices i and 
j; and by employing the simplification process, we have eliminated all 
redundancies, including that of going over the same edge more than once 
(xx = x) and going over the same vertex more than once (x + xy = x). Thus 
the ijth entry in matrix Q* represents all paths of length k or less between 
vertices i and j. Since in an ^-vertex graph the longest path is of length n — 1, 
we have 

Theorem 12-1 

The transmission matrix T of an /7-vertex contact network, with primitive 
connection matrix Q, is given by 


T = Q»- 1 . 

In case one is interested in evaluating the switching function only between 
a specified pair of vertices, Theorem 12-1, which computes the switching 
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functions between all n(n — l)/2 pairs, is wasteful. Theorem 12-2 is more 
efficient: 

Theorem 12-2 

Let Q i7 be the //'th minor of the primitive connection matrix Q (computed in 
Boolean algebra and simplified using Boolean identities). Then the switching func¬ 
tion Fij equals Q, ; . 


Theorem 12-2 can be proved using arguments similar to those which led to 
Theorem 12-1. The details of the proof are left as an exercise. 

Even the evaluation of the minor Q. y is quite laborious and cumbersome. 
A simpler method, called the node-removal method , is often employed in 
evaluating F ij . The interested reader is referred to [12-10], pages 315-323, for 
details on node-removal techniques. 


12-3. SYNTHESIS OF CONTACT NETWORKS 

Designing a network from given requirements is the general problem of 
network synthesis. We can assume that the requirements of the contact 
network to be designed are given in the form of switching functions. (If they 
are given in any other form, they can be converted into switching functions.) 
We can further assume that the switching functions are given in normal form 
(i.e., as a Boolean sum of products). 

Two-Terminal Contact Networks: In a two-terminal synthesis we are given 
just one switching function F(x { , x 2 ,, x m ) of m variables, in normal form, 
and we are to design a network realizing this function as the transmission 
between two of its vertices. 

This problem is trivial if we are not concerned with economizing the 
number of contacts, because any switching function in a natural form can be 
realized by a sufficiently large number of contacts. But such an extravagant 
realization is usually not acceptable. A realization to be useful should contain 
as few contacts as possible. It is this requirement that makes the synthesis 
problem difficult. In Fig. 12-4, for example, are shown three of many possible 
realizations of a very simple switching function (with only four variables). 
The simplest among the three networks is the one in Fig. 12-4(c). We may ask 
if this is the most economic realization possible. If so, how can we be sure? 
Are there any methods that will guarantee our arriving at a most efficient 
contact network for a given function? 

The problem of finding a contact network that realizes an arbitrary 
switching function with a minimum number of contacts has not been solved 
yet (except by exhaustive enumeration) and is not likely to be solved in the 
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(a) Realization with Nine Contacts (b) Realization with Seven Contacts 



(c) Realization with Five Contacts 

Fig. 12-4 Three different realizations of F a b = wx + wy + wz 

+ xyz. 

near future. However, if we consider a restricted type of switching network, 
called single-contact networks, the problem becomes manageable. 

Single-Contact Network: A contact network in which every binary vari¬ 
able x t (either in uncomplemented or complemented form) is associated with 
only one edge is called a single-contact (or SC) network. Thus each contact in 
an SC network can be opened or closed independently. For example, the 
network in Fig. 12-3 is an SC network, but those in Figs. 12-2 and 12-4 are 
not. 

Any transmission that can be realized by an SC network is called a single¬ 
contact function (or SC function). Since in an SC network a variable appears 
only once, it is not possible to simplify the sum of its path products any 
further [the type of simplification performed on expression (12-2) to produce 
expression (12-3)]. In other words, an SC function contains no redundant 
terms. Every product term represents a distinct path between the specified 
terminals, and every literal in a product term corresponds to a distinct edge in 
the path. For example, switching function F — x x x 2 + x 2 x' 3 + x\x 2 x 4 is not 
an SC function. 

Realization of an SC Function: Once we are assured that a given switching 
function F ab is an SC function, we know unambiguously every path between 
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vertices a and b in the network we intend to design. The following procedure 
shows how to design an SC network for an SC switching function. The net¬ 
work is unique up to 2-isomorphism. This is because the set of all paths 
between a pair of vertices specifies a graph uniquely within 2-isomorphism 
(see Section 4-8 and Problem 7-25). 

Procedure for Realizing a Given SC Function of m Variables x t ,x 2 ,..., x m : 

Step 1: From F ab obtain the path matrix P (a>b) with respect to the vertex 
pair (a, b). 

Step 2: Append a column of all l’s to P (a _ ft) . This implies the addition 
of an edge (with associated variable, say, ;t 0 ) between a and b, thus con¬ 
verting every path into a circuit. Let the resulting circuit matrix be denoted 
as B. 

Step 3: Use Jordan’s method of elimination (mod 2) to eliminate all 
dependent rows in B. Rearrange the resulting fundamental circuit matrix B / 
into the standard form (review Section 7-4, if necessary): 

B/ = [I ! B,]. 

Step 4: From B / obtain the fundamental cut-set matrix C f , given by 

C, = [C e ; /„_,] = IB? ! 

Step 5: From C / obtain A f , the reduced incidence matrix, by appropriately 
performing modulo 2 sums of rows in C^. This corresponds to obtaining a 
nonsingular transformation matrix R such that 

A f = R-Cp 

where A f has at most two l’s in each column. In step 5 we are essentially 
taking different ring sums of fundamental cut-sets so that they produce 
sets of edges incident at each vertex. This is the most laborious step in the 
entire procedure, and becomes prohibitive for large graphs (more on this 
later). 

Step 6: Form the incidence matrix A by adding the missing row to A f (so 
that each column has exactly two l’s). From matrix A draw the graph, and 
remove edge x 0 . 

Example: Let us apply this procedure to obtain a graph that realizes the 
SC function 


" ab X x X 1 X i X^X 1 + X 1 X 3 X 4 X 6 + *1 -*5 *6 -^8 T X 2 X 4 

+ x 2 x 3 x 5 x 8 + x 3 x 4 x 6 x 7 x s + x 5 x 6 x 7 . 


(12-5) 
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Step 1: The path matrix is 


x, x 2 x 3 x 4 x 5 x 6 x 1 x 8 


P 


(a,b) 


"1110 
10 11 
10 0 0 
0 10 1 
0 110 
0 0 11 
0 0 0 0 


10 10" 
0 10 0 
110 1 
0 0 0 0 
10 0 1 
0 111 
1110 


Step 2: Appending a column of l’s at the end of matrix P (a>6) and iden¬ 
tifying the column by variable x 0 , we get circuit matrix B: 


x, -X2 x 3 x 4 jx ; 5 x 6 x 7 x s x 0 


B = 


"1110 
10 11 
10 0 0 
0 10 1 
0 110 
0 0 11 
0 0 0 0 


10 10 
0 10 0 
110 1 
0 0 0 0 
10 0 1 
0 111 
1110 


1 

1 

1 

1 

1 

1 

1 


Step 3: This step is somewhat involved. Jordan’s method of elimination 
consists of adding (mod 2) rows to other rows so as to form an identity 
submatrix. For example, adding the first row to the second as well as to the 
third in B we get 


B, 


X, x 2 x 3 x 4 x s x 6 x 7 x 8 x 0 
"1110 10 10 1 “ 
0 10 11110 0 
0 110 0 1110 
0 10 10 0 0 0 1 
0 110 10 0 11 
0 0 110 1111 
0 0 0 0 1 1 10 1 



338 


GRAPHS IN SWITCHING AND CODING THEORY 


CHAP. 12 


In B, adding the second row to the fourth and sixth rows, we get 


X , * 2 * 3 X 4 X 5 X 6 X 7 X 8 X 0 


b 2 


~1 1 101 
0 10 11 
0 110 0 
0 0 0 0 1 
0 110 1 
0 110 1 
0 0 0 0 1 


0 1 
1 1 
1 1 
1 1 
0 0 
0 0 
1 1 


o r 
0 0 
1 0 
0 1 
1 1 
1 1 
0 1 


In this attempt to eliminate all but one 1 in a column (making sure the 1 in 
each column occurs in a different row), we ultimately get 


*1 

X 2 

*3 

*4 

*5 

X 6 

x 7 

x 8 

*0 

“1 

1 

1 

0 

1 

0 

1 

0 

r 

0 

1 

0 

1 

0 

0 

0 

0 

1 

0 

1 

1 

0 

1 

0 

0 

1 

1 

0 

0 

0 

0 

1 

1 

1 

0 

1 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

_0 

0 

0 

0 

0 

0 

0 

0 

0_ 


From this matrix we get the fundamental circuit matrix in standard form: 


*1 

*4 

*8 

X 6 

x 2 

*3 

*5 

x 7 

x 0 

"1 

0 

0 

0 

1 

1 

1 

1 

r 

0 

1 

0 

0 

1 

0 

0 

0 

1 

0 

0 

1 

0 

1 

1 

1 

0 

1 

_0 

0 

0 

1 

0 

0 

1 

1 

1 _ 


= [« 4 : bj. 

Now we have the following information about the desired network: 

rank of circuit matrix B = /i = e — w + 1 = 4, 
number of edges (including x 0 ) e = 9, 

Therefore 

number of vertices n = 6, 
rank of the cut-set matrix = n — 1 — 5. 
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Step 4: The fundamental cut-set matrix (with respect to the same tree as 
B f in step 3) is immediately obtained as 


*1 

*4 

*8 

*6 

*2 

*3 

*5 

x 7 

*0 

"1 

1 

1 

0 

1 

0 

0 

0 

0 “ 

1 

0 

1 

0 

0 

1 

0 

0 

0 

1 

0 

1 

1 

0 

0 

1 

0 

0 

1 

0 

0 

1 

0 

0 

0 

1 

0 

1 

1 

1 

1 

0 

0 

0 

0 

1 _ 


Step 5: After many trials we find that if we perform the following three 
elementary row operations on C f we get a matrix that contains at most two 
l’s per column. The operations are 


Add (mod 2) row 5 to row 1, 
Add (mod 2) row 3 to row 5, 
Add (mod 2) row 4 to row 3. 

The resulting reduced incidence matrix A f is 


*1 

*4 

X* 

*6 

*2 

*3 

*s 

x 7 

*0 

"0 

0 

0 

1 

1 

0 

0 

0 

r 

1 

0 

1 

0 

0 

1 

0 

0 

0 

0 

0 

1 

0 

0 

0 

1 

1 

0 

1 

0 

0 

1 

0 

0 

0 

1 

0 

_0 

1 

0 

0 

0 

0 

1 

0 

1 _ 


Step 6: We get the incidence matrix A by adding a row at the bottom such 
that every column now has exactly two l’s. 


x 4 x 8 x 6 x 2 x 3 - X 5 x 7 x 0 

"oooi looor 
10 10 0 10 0 0 
0 0 10 0 0 1 10 

1 0 0 10 0 0 1 0 
0 10 0 0 0 1 0 1 

0 10 0 1 1 0 0 0 


Finally, the required contact network is constructed (see Fig. 12-5) from the 
incidence matrix A, and then edge ,x 0 is deleted. 
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Fig. 12-5 Realization of F a b in expres¬ 
sion (12-5). 


In this six-step synthesis of an SC switching function, we see that steps 
1, 2, 4, and 6 are easy and require no conditions on F ab for their completion. 
Step 3 involves some labor, but it is also guaranteed to terminate. Any matrix 
of rank k can be reduced by Jordan’s process of elimination to one of the 
following forms (see [7-3]): 


[I* ! 0], 


n* 

°] 

_0 

0] 


or 


Realizability: Steps 1, 2, 3, and 4 can always be performed whether or not 
the given switching function F is an SC function. The procedure will fail at 
step 5 if F is not an SC function, and we will not succeed in obtaining a 
reduced incidence matrix by elementary row operations. This leads us to an 
extremely important question in graph theory: When can a given (0, 1)- 
matrix be a cut-set matrix of some graph? An arbitrary (0, l)-matrix M may 
or may not be a cut-set matrix. For example, the matrix 


~1 

0 1 

1 

1 

0 

0 

1 

1 0 

1 

0 

1 

0 

1 

1 1 

0 

0 

0 

1 


cannot be a cut-set matrix of any graph. This can be verified by considering 
all seven possible (mod 2) sums of the three rows, and observing that this 
matrix cannot be transformed by elementary row operations into a matrix 
with at most two l’s per column. In other words, no incidence matrix can be 
found to correspond with L as a cut-set matrix. 

The matrix L is unique. It can be shown that this is the smallest (0, 1)- 
matrix which cannot be a cut-set matrix of any graph. It is also clear that any 
matrix M, if it contains L as a submatrix, cannot be a cut-set matrix either. 

Let us look at another facet of the situation. A cut-set matrix of a graph 
G is also the circuit matrix of its dual G* if and only if G is planar. Suppose 
that matrix M = [M, ; IJ contains a submatrix which we know to be the 
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circuit matrix of some nonplanar graph H; then M cannot be a cut-set matrix; 
otherwise, we have a situation where a nonplanar subgraph has a dual, which 
is impossible. Thus we have a second necessary condition: if a matrix M = 
[M, ! IJ is to be a cut-set matrix, it must not contain the circuit matrix of any 
nonplanar graph. From Theorem 5-9, we know that a graph is nonplanar if 
and only if it has as a subgraph either of the two Kuratowski graphs or any 
graph homeomorphic to either of them. Therefore, if M is to be a cut-set 
matrix, it must not contain a circuit matrix of either Kuratowski graph, or 
any graph homeomorphic to either of them. 

It has been shown by Tutte in a remarkable paper that the two necessary 
conditions discussed so far are also sufficient. The proof of sufficiency is 
extremely long and is based on the theory of matroids. The realizability 
conditions for a cut-set matrix are precisely stated in Theorem 12-3 (for a 
proof see [12-11] or [12-14]). 

Theorem 12-3 

Necessary and sufficient conditions for the (0, l)-matrix M to be a cut-set 
matrix are that 

1. M does not contain L or L r as a submatrix. 

2. M does not contain the circuit matrix of either Kuratowski graph, or any 
graph homeomorphic to either of them. 

Realizability of M as a Circuit Matrix 

Suppose that we want to find whether or not a matrix M = [I* M 2 ] is the 

fundamental circuit matrix (rather than cut-set matrix) of some graph. The 
following result, the analog of Theorem 12-3 and proved by Tutte in the same 
paper, has the answer. 

Theorem 12-4 

Necessary and sufficient conditions for the (0, l)-matrix M, to be a circuit 
matrix are that 

1. M does not contain L or L r as a submatrix. 

2. M does not contain the cut-set matrix of either Kuratowski graph, or any 
graph homeomorphic to either of them. 

Note that an arbitrary (0, l)-matrix M falls into one of four categories: 

1. M is a fundamental cut-set matrix of some graph G and a fundamental 
circuit matrix of another graph G * (graphs G and G* are planar). 

2. M is a fundamental cut-set matrix of some graph G, but is not a circuit 
matrix of any graph (G is nonplanar). 
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3. M is a circuit matrix of some graph G, but is not a cut-set matrix of any 
graph (G is nonplanar). 

4. M is neither a cut-set matrix nor a circuit matrix of any graph. 

12-4. SEQUENTIAL SWITCHING NETWORKS 

So far, we have considered only combinational switching networks. Let 
us now study the sequential switching networks (better known as sequential 
machines t). As pointed out earlier in the chapter, the output of a sequential 
network depends not only on the present inputs but also on their past history. 
A sequential machine must, therefore, be able to retain information about the 
past inputs. This introduces the concept of “state” of a sequential network, 
where the “state” corresponds to the memory of the past inputs. Mathemati¬ 
cally, a sequential machine is defined as follows: 

A sequential machine is a mathematical system M, which consists oft 

1. A finite set V = {v { , v 2 , . . . , v n } of internal states (or simply states ). 

2. A finite set X = (a:,, x 2 , . . . , x m } of inputs called the input alphabet. 

3. A finite set Z = (z 1? z 2 , . . ., z p ] of outputs called the output alphabet. 

4. A function or mapping that assigns to every combination of the present 
state and the present input (v p x y ) a next state v k . This function is called 
the transition function of M. 

5. Another function, called the output function, assigns an output z s to 
every combination (v p x y ) of the present state and the present input. 

There are two equivalent methods of describing a sequential machine: (1) 
a tabular form, called the state table, and (2) a weighted, directed graph, 
called the state graph (or state diagram). 

Each vertex in the state graph corresponds to a state of the sequential 
machine, and each directed edge represents a transition from the present state 
to the next. Every edge (v p v } ) has an ordered pair of weights x k , z q assigned 
to it. This weight pair represents the fact that if the present state of the 
machine is v t and if the present input is x k an output z q results, and the next 
state will be v } . The state table and the state graph of a sequential machine 

tSequential switching networks are also called sequential networks, sequential machines, 
sequential nets, or sequential circuits. The terms finite-state machines and automata are also 
used for sequential switching networks. The form sequential machine is perhaps the most 
commonly employed term and we shall use this term. 

JThis definition of a sequential machine is somewhat restricted. It is the Mealy model 
of a deterministic and completely specified sequential machine. 
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with 

states V = {A, B, C, D], 
inputs A" = {1, 2}, 

outputs Z = {a, b, c} 

are shown in Fig. 12-6. In the state graph the edge with weight pair from ver¬ 
tex A to B, for example, indicates that when the machine is in state A and the 
input 1 is applied the machine produces an output a and will go into state B. 



D 1, c C 


Next state/output 

Present 

state _1_ 2 Inputs 


A 

B/a 

A lb 

B 

C/c 

D/c 

C 

D/c 

B/b 

D 

B/a 

A/c 


Fig. 12-6 State graph and state table for a sequential machine. 


Properties of State Graphs 

The following observations can be made about the properties of state 
graphs: 

1. In response to each specified input the machine in a given present state 
goes into a specific next state. Therefore, the out-degree of each vertex 
is m, one for each input; and the state graph has nm edges. Note that 
there is no similar restriction on the in-degrees. 
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2. Since an input may leave a sequential machine in its present state, 
self-loops may occur in a state graph. 

3. A state graph may also have parallel edges, but they will have different 
weight pairs. 

4. In most cases one of the states of a sequential machine is designated as 
a starting state, and the machine is required to be in this state before 
any input is applied. The state graph of a machine with a designated 
starting state is regarded as a rooted digraph, the root being the 
starting state. 

5. A state (if any) that the machine cannot leave, no matter which input 
is applied, is called a persistent state. If a sequential machine has a 
persistent state, the corresponding vertex will have no directed edge 
going from it to another vertex. 

6. A sequential machine is said to be strongly connected if its state graph is 
strongly connected. Thus a sequential machine M is strongly connected 
if and only if M can be brought to any state from any other state by an 
appropriate input sequence. 

The state graph of a sequential machine contains all the information about 
the machine. Therefore, it is possible to study the properties of a given 
machine by studying its state graph. Some of the problems that arise in the 
theory of sequential machines are 

1. Analysis: In analyzing the behavior of a machine, we may, for instance, 
be interested in determining the response (next states and outputs) of a given 
machine to a certain input sequence. Or we may be interested in drawing some 
conclusion about the internal behavior of a machine by applying a series of 
inputs and observing the outputs. If a machine has a designated starting 
state, the application of a given input sequence results in a unique output 
sequence. 

2. Synthesis: To design a machine having a desired behavior, we start 
with the statement of the desired response and construct a state graph. 
Consider the following example: 

Problem: Design a sequential machine to respond to an arbitrary input 
sequence of 0’s and l’s. The machine should produce an output of 1 when¬ 
ever there appears a set of four consecutive input bits of value greater than 9 
in a serial 8-4-2-1 BCD code (the least significant bit comes to the machine 
first). Whenever the value of a four-bit sequence is 9 or less (i.e., 0000, 0001, 
0010, . . . , 1001), the output should be 0. 

Solution: The machine should store the last three consecutive bits and 
should examine and respond to the next bit. Therefore, we should start with 
an eight-state (2 3 = 8) sequential machine. Let the eight states 000, 001, 010, 
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Oil, 100, 101, 110, and 111 be designated by A , B, C, D, E, F, G, and H, 
respectively. The input alphabet consists of [0, 1], and the output alphabet 
also consists of [0, 1}. 

When a new bit arrives at the left, the machine drops the rightmost bit and 
stores the new bit together with the two old ones. For example, if the machine 
is in state C (i.e., 010) and a 1 arrives, the next state is 101 (i.e., F ) and the 
output is 1 (corresponding to 1010). The state table and the state graph of 
such a sequential machine can be easily constructed and are shown in Fig. 12-7. 
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Fig. 12-7 State table and state graph. 

3. State equivalence and reduction: The eight-state machine we just 
obtained is not necessarily the “simplest” one to perform the specified task. 
The next step, and a very important step, is to examine the state graph and see 
if it can be reduced to a “simpler” machine. The reduction can be accom¬ 
plished if we can determine whether or not two states in a given machine (i.e.. 
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a pair of vertices in the state graph) are equivalent. If two states produce the 
same outputs and also go to a pair of equivalent next states for every input, 
they can be considered as one state and given the same label wherever they 
occur in the state table. 

In the state graphs we can fuse the two equivalent vertices and remove any 
redundant edges (parallel edges with identical weights) that may result from 
the fusion. In Fig. 12-7 vertices A and B are equivalent, and therefore they can 
be fused. This fusing results in the seven-state machine shown in Fig. 12-8(a). 

The process of reduction is shown in Fig. 12-8. When completed it yields 
the state graph of Fig. 12-8(e). State A in Fig. 12-8(e) is the replacement of A 
and B in the original state graph, state C is for C and D, and state E is for E , 
F, G, and H. 

The three-state sequential machine in Fig. 12-8(e) performs the same task 
as the original eight-state machine in Fig. 12-7 did. This simple example 
illustrates the importance of the state-reduction process. 

4. State assignment: The next step is the implementation of a sequential 
machine from the reduced state graph. Assuming that binary memory devices 
(i.e., two-state devices such as flip-flops or toggle switches) are used, an 
n-state machine will require q such devices, where 

2 q ~ l <n< 2 q . 


The q binary memory devices allow 2 q possible states. How to assign n of 
these 2 q states to the n vertices of the state graph such that we get the most 
economical machine is the problem of state assignment. 

In graph theoretic terms the state assignment problem is the same as that 
of labeling the vertices of an /7-vertex digraph with available 2 q (> n) labels, 
with certain optimizing criteria. 

Finding an efficient algorithm to obtain the “best” assignment is an 
important unsolved problem in the theory of sequential machines. Listing all 
possible assignments and then picking out the best is impractical even for 
machines with 10 states. However, for a very small machine, such as the 
three-state machine in Fig. 12-8(e), it is possible to look at all distinct 
assignments and compare them. For n = 3 and q = 2, the number of distinct 
assignments is 3. The following table shows three distinct assignments ( y , and 
y 2 are the two memory devices). 


States 


Assignment I Assignment II Assignment III 


>t yi y i yi y i yi 


A 0 0 0 0 0 0 
C 0 1 0 1 1 1 
E 10 1110 
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If you are familar with logical devices, design these machines completely, 
using flip-flops (or delay lines) and gates. You will notice that one assignment 
is decidedly superior to the other two. 

There are a number of very important but difficult problems in sequential 
machine theory. Graph theory may have potential for solving many of these 
outstanding problems. 

12-5. UNIT CUBE AND ITS GRAPH 

Consider a set of m switching variables jc,, x 2 ,. . ., x m . Each x ( can take a 
value of either 0 or 1. Therefore, we can form 2 m distinct m-tuples. Each of 
these m-tuples can be represented by a vertex of the m-dimensional unit cube. 
Unit cubes for m = 1, 2, and 3 are shown in Fig. 12-9. The extension to 
m > 4, although geometrically difficult, is simple enough to visualize. 

10 11 


•-• 

0 1 

0 . 

00 01 
G 2 




Fig. 12-9 One-, two-, and three-dimensional cubes. 

The edges and vertices of an m-dimensional unit cube form a graph with 
2 m vertices. Each vertex is labeled as a distinct binary sequence of m bits such 
that two vertices are adjacent if and only if they (i.e., their labels) differ in 
exactly one bit. Such a graph is called an m-cube and will be designated by 
Q m . Once again, how we draw the m-cube is immaterial as long as we preserve 
the adjacency relationships of its vertices. For example, Q 3 is drawn in another 
way in Fig. 12-10(a). The 4-cube is sketched in Fig. 12-10(b). The m-cube is of 
interest in studying switching functions of m binary variables. The state- 
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(a) (b) 

Fig. 12-10 Graphs of 3-cube and 4-cube. 


assignment problem, discussed in the last section, can be looked upon as a 
problem of selecting and labeling vertices of an m-cube. See Chapter 13 of 
[12-2]. 

Some observations that can be made about the properties of an m-cube, 
Qnr are 

1. There are exactly m distinct labels that differ from a given label (of m 
bits) in one position. Therefore, each vertex in Q m is of degree m. Thus 
Q m is a regular graph of n = 2 m vertices and e = m- 2 m ~ l edges. 

2. The distance Vj) (i.e., the number of edges in a shortest path) 
between two vertices v. and v } in an m-cube is equal to the number of 
positions in which the labels of v,. and v s differ. For example, in Q 3 
in Fig. 12-9 the distance between (011) and (101) is 2. This distance is 
known as the Hamming distance between the two binary words. It is 
easy to see that 

8{v., Vj) = number of 1 ’s in mod 2 vector sum of the labels of v. and v } . 

3. The maximum distance possible between two vertices in an m-cube is 
m, because two m-bit sequences can differ at most in m positions. 

Subcubes: A ^-dimensional cube can be looked upon as a subcube of 
higher-dimensional cubes. Similarly, graph Q k may be regarded as a subgraph 
of Qm & < m) such that Q k consists of the 2 k vertices (of Q m ), whose labels 
have identical m — k corresponding bits. For example, the vertices (Oil), 
(001), (111), and (101) in Fig. 12-9 have the same last bit, and constitute a 
subcube Q 2 in Q 3 . Each vertex is a 0-cube, and any edge is a 1-cube. 

Minterms: A Boolean product containing each of m variables x x , x 2 , . . . , 
x m exactly once, either complemented or uncomplemented, is called a 
minterm (or canonic product) of m variables. For example, the minterms of 
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three variables = a, b, c are ( a'b'c '), ( ab'c ), {a be), ( a’bc ), ( ab'c '), {ab'c), 
( abc' ), and (#6c). 

There are 2 m distinct minterms of m variables, and they can be put into a 
one-to-one correspondence with the vertices of an m-cube. The minterm 
(x\x' 2 . .. x' m ) corresponds to (0 0 ... 0) vertex, (x'j x' 2 . . . x' m . jxJ corresponds 
to (0 0 ... 0 1) vertex, and so on; finally, the minterm (x x x 2 . . . x m ) corre¬ 
sponds to vertex (1 1 ... 1) of Q m . 

Switching Functions on the m-Cube: Any switching function f{x x ,x 2 ,..., 
x m ) of m variables can be expressed uniquely as a Boolean sum of a subset of 
2 m minterms. This is termed as the canonic form of /. Clearly, the function / 
is 1 at those and only those vertices whose corresponding minterms are 
present in the canonic form of/. At all other vertices the function/is 0. The 
verticest of Q m at which/is 1 are called true vertices with respect to function 
/ and the vertices at which / is 0 are called the false vertices of Q m with 
respect to function/ For example, consider the following function of three 
variables: 


/(*,, x 2 , x 3 ) = x\x' 2 x' 3 + x x x' 2 x' 3 + x'.XjX'a + x',x 2 x 3 . 

The true vertices for this function on Q 3 are shown encircled in Fig. 12-11. 



Fig. 12-11 True vertices on a Q 3 for 
a given function. 


Thus every switching function of m variables uniquely partitions the 
vertices of the graph Q m into two sets, one consisting of the true vertices and 
the other consisting of the false vertices. There are 2 2 ”‘ such partitions^ each 
corresponding to a distinct switching function of m variables. Thus the prop¬ 
erties of switching functions can be determined by studying the properties of 
the subgraph of Q m defined by the true vertices with respect to the given 
function. 

•fThose familiar with Karnaugh map will recognize that a vertex in Q m corresponds to 
a square in Karnaugh map. 

tThis includes two extreme cases when all 2 m vertices are true (i.e.,/— 1), and all 
vertices are false (i.e.,/ = 0). Usually, partitions do not have empty subsets, but here we 
have called these two cases also partitions. 
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Gray Codes: Often, when information is converted from analog form to 
its digital equivalent, one requires a list of distinct binary m-tuples such that 
each differs from the one preceding it in just one coordinate. For example, to 
determine the angular position of a rotating shaft, the angles in adjacent 
quantum intervals are encoded into m-tuples (using m brushes on a com¬ 
mutator) of binary digits that differ in just one place. Taking m — 3, for 
instance, as the angle increases from 0 to 360°, the binary code for angles 
might go through the succession 


and back to 


000 

for 

0-45°, 

001 

for 

45-90°, 

Oil 

for 

90-135°, 

010. 

for 

135-180°, 

110 

for 

180-225°, 

111 

for 

225-270°, 

101 

for 

270-315°, 

100 

for 

315-360°, 

000 

for 

0-45°. 


Such a code, which requires the changing of only one bit at a time, is 
called the Gray code, the reflected binary code, circuit code, or cyclic code. In 
contrast to the Gray code, other codes may require changing of several bits 
when going from one number to the next higher number. For example, going 
from 7 to 8 in 8-4-2-1 BCD (i.e., from 0111 to 1000) involves a change in all 
four bits simultaneously. Because of variations in the construction of the 
equipment, such multiple changes may not register simultaneously. Thus, 
during the change, false code combinations are supplied. Such false code 
words are eliminated in a Gray code, and this is why Gray codes are so 
important in analog-to-digital conversion of information. 

An m-bit Gray code corresponds to a circuit in an m-cube. For instance, 
the 3-bit Gray code just illustrated for measuring the angular position of the 
rotating shaft is defined by the Hamiltonian circuit in Q 3 in Fig. 12-12 shown 
in heavy lines. The reason for the term cyclic or circuit code should be clear 
from this figure. 

An m-bit code that uses all 2 m vertices is called a complete code. A circuit 
code need not be a complete code. For example, when 4-bit words are used to 
represent decimal digits, we use only 10 out of 16 vertices. 
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Snake-in-the-Box Codes: In selecting an incomplete code from 2 m avail¬ 
able words, one would like to select a code that has certain error-checking 
properties. One such code has the desirable property that a single binary error 
(caused by malfunctioning of the equipment) in a word results in either (1) the 
next word, (2) the preceding word, or (3) a word that does not appear in the 
code at all. The last case indicates a detected error, and the first two cases 
introduce errors of relatively small magnitude. Such a code is called a snake- 
in-the-box (SIB) code, or unit-distance error-checking code. 

An SIB code corresponds to a circuit in Q m such that no two nonsucces- 
sive vertices on the circuit are adjacent. A 6-word, SIB code in Q 3 is shown in 
Fig. 12-13. 



The SIB codes can be generalized to codes with additional error-checking 
properties, as follows: In graph Q m , a circuit C s is said to be of spreads if a 
person going around C s cannot find a shortcut (i.e., a path with no edge from 
C s ) between two vertices of C s consisting of fewer than s edges of Q m . With 
this definition, every circuit in Q m is of spread I, and an SIB code corresponds 
to a circuit of spread 2. 

For a given m and a specified s, one would like to find as large a circuit C s 
as possible. At present no relationship is known that gives the size of the 
largest C s in a Q m for arbitrary m and s. For a survey of such problems on 
codes in Q m , the reader is referred to a paper by Klee [12-8]. 

Huffman Graph-Theoretic Codes: We shall now briefly discuss the applica¬ 
tion of graphs to an entirely different type of coding. A binary group code is a 
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set of binary code words with the property that the modulo 2 sum of arty two 
code words in the set is also a code word in the set.t Binary group codes are of 
importance in information transmission, both for analytic and practical 
reasons. The group structure facilitates their mathematical study as well as 
their implementation. For more details on group codes see [12-12]. 

Since the ring sum of two cut-sets in a graph is another cut-set or an edge- 
disjoint union of cut-sets, it is evident that the set of all cut-sets and edge- 
disjoint union of cut-sets can be used to define a binary group code. In other 
words, the vectors (2 r of them, r being the rank of the graph) in the cut-set 
subspace fV s , over GF(2), constitute a binary group code. 

The rows of a fundamental cut-set matrix can be used to generate this 
binary group code. Such a code is called a Huffman graph-theoretic code. 

For example, consider a graph and its fundamental cut-set matrix C f in 
Fig. 12-14. The rows of C f , their modulo 2 sums, and the zero vector yield the 
5-bit, 8-word code shown in Fig. 12-14. 



1 0|1 

0 

0 

1 1 1 0 

1 

1 

0 

1 

0 1 1 0 

0 

1 


r 


(5. 3. 2) code- ■< 


10100 
11010 
01001 
01110 
10011 
11101 
00111 
00000 


Fig. 12-14 Graph and its cut-set code. 

Analogously, the fundamental circuit matrix of a graph also generates a 
binary group code. Thus we have two graph-theoretic codes associated with 
every graph. 

tNote that we are and have been discussing only binary codes and only those binary 
codes in which each code word consists of the same number of bits. Such a code is called 
a uniform binary code or a binary block code. Gray codes and binary group codes are 
examples of binary block codes. 
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A graph-theoretic code is generally specified by three numbers—the 
number of edges e in the graph, the dimension of the associated subspace, 
W s or W r (i.e., rank r or nullity pi), and the smallest number of l’s in a non¬ 
zero code word. Thus the graph-theoretic code generated by the cut-sets of 
the graph in Fig. 12-14 is a (5, 3, 2) code. 

Now that we know how to generate a code (in fact two codes) from any 
graph, we can investigate codes corresponding to important kinds of graphs— 
such as complete graphs, bipartite graphs, regular graphs, and planar graphs. 
Conversely, we can look for graphs that generate group codes with certain 
specified properties, such as efficiency and error-correcting capability. This is 
an area of current research. Some relationships between the properties of 
graphs and the properties of the associated codes have been investigated by 
Huffman [12-7], Frazer [12-3], Hakimi and Bredeson [12-4], and Saltzer 
[12-13]. 


SUMMARY 

In this chapter graph theory was applied to switching circuits, automata 
theory, and coding theory. The applicability of graphs to digital systems and 
signals is not surprising, because both operate in GF(2). 

For lack of space, only selected applications in switching theory were 
discussed. Many related topics, such as the study of series-parallel contact 
networks, planar and nonplanar contact networks, and regular expressions, 
were not even mentioned. Several other topics, such as the graphs of gate-type 
networks, generalized SIB codes, and properties of Huffman codes, were also 
left out. These are some of the areas of current research in switching theory. 
It is hoped that the serious reader will go to the references cited for a fuller 
account of this fascinating application of graph theory. 
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1 ELECTRICAL NETWORK ANALYSIS 

Iv BY GRAPH THEORY 


One of the reasons for the recent revival of interest in graph theory among 
students of electrical engineering is the application of graph theory to the 
analysis and design of electrical networks (more commonly known as elec¬ 
trical circuits). The idea of using graph theory for predicting the behavior of 
an electrical network is not new. It originated with G. Kirchhoff in 1847 and 
was improved upon by J. C. Maxwell in 1892. However, for hand computa¬ 
tions (which were necessarily limited to small networks), the application of 
graph theory to network analysis offered little real advantage over the more 
elementary methods of node or loop analysis. 

The picture has changed and is changing since the arrival of the high-speed 
digital computer. A milestone in graph-theoretic analysis of electrical net¬ 
works was achieved by W. S. Percival, when he extended the Kirchhoff and 
Maxwell methods to networks with active elements. Computer programs are 
now available for analysis of large networks [13-2] based on the graph- 
theoretic approach. More efficient and less user-oriented computer programs 
for analyzing larger and more general types of networks are in the offing. In 
this chapter we shall present the underlying principle of graph-theoretic 
analysis of networks'—which is how to use spanning trees (or chord sets) for 
evaluating determinants of a matrix. 

Reminder on Terminology: Different disciplines using graph theory have 
developed somewhat different terminology. In electrical engineering, the term 
branch is used for edge, node for vertex, and loop for circuit. An electrical 
network is more commonly known as an electrical circuit. For the sake of 
consistency, however, the same graph theory terminology has been used 
throughout this book. 
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13-1. WHAT IS AN ELECTRICAL NETWORK? 

An electrical network is a collection of interconnected electrical elements 
(or devices) such as resistors, capacitors, inductors, diodes, transistors, 
vacuum tubes, switches, storage batteries, transformers, delay lines, power 
sources, and the like. The behavior (such as the response to a unit impulse) of 
an electrical network is a function of two factors: (1) the characteristics of 
each of the electrical elements, and (2) how they are connected together, that 
is, their topology. It is the latter factor that brings graph theory into the 
picture. 

An electrical element can be 

1. Lumped or distributed. 

2. One-port (i.e., two-terminal) or multiport. 

3. Linear or nonlinear. 

4. Time invariant or time varying. 

5. Passive or active. 

6. Bilateral or nonbilateral. 

To avoid using partial differential equations, a distributed element, such 
as a transmission line, is either approximated by lumped elements or is con¬ 
sidered separately. Thus an electrical network almost by definition implies a 
network consisting of lumped elements only. Also, a multiport device such as 
a transformer or a pentode can be replaced by a set of interconnected two- 
terminal elements, such as resistors, inductors, and dependent power sources 
(see Fig. 13-7). Thus we can confine ourselves to a network of lumped, two- 
terminal elements. 

A two-terminal electrical element is represented by an edge e k . Associated 
with each edge are two edge variables , v k (t) and i k (t). The variable v k (t ) is 
called the edge voltage and may be regarded as a cross variable , because it exists 
across the two end vertices of the edge. The other variable i k (t) is called the 
edge current and may be thought of as a through variable , because it flows 
through the edge. Since the variables are directional, every edge is assigned 
an arbitrary orientation (see Fig. 13-1). The characteristics of each element 
are completely described in terms of these two variables. (The physics of an 
electrical element and its mathematical description form another subject in 
electrical engineering and are of little concern to us here.) 

Thus an electrical network for us is a connected directed graph G in which 
each edge e k is assigned two variables v k (t) and i k (t). The edge variables of each 
edge satisfy a relationship imposed by the nature of the corresponding ele¬ 
ment. Let the directed graph G have n vertices 1, 2, 3,...,« and e edges 
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&th electrical branch 


‘k(0 

•-►-• 

+ v k (t) - 


Edge k 


(a) 


(b) 


Fig. 13-1 Electrical element and its representation as an edge of 
a directed graph (the voltage + is always at tail of current arrow). 


b x , b 2 ,... , b e . Let the values of currents flowing through these edges at a 
given time be represented by a column vector (called th e edge-current vector) 
i(/), where 


Ri(0' 


i(0 = 


i 2 (t) 


hi!) J 


Similarly, the edge voltages across the e edges are represented by another 
vector (called the edge-voltage vector ) v(/), where 


v(f) = 


v x {ty 

v 2 (t) 




13-2. KIRCHHOFFS CURRENT AND 
VOLTAGE LAWS 

It was mentioned that each element in an electrical network is governed 
by a specific relationship imposed upon its two edge variables. When the 
elements are interconnected to form a network, is there any additional 
relationship imposed on these edge variables collectively? The answer, as 
every electrical engineer knows, is yes. The edge variables must also obey the 
two laws of KirchhofT’s: 

Kirchhoff's Current Law(KCL): For any lumped electrical network, at any 
time the net sum (taking into account the orientations) of all the currents 
leaving any node (or vertex) is zero. That is, at the rth vertex of the corre¬ 
sponding directed graph (7, we must have 

2 a rk i k {t) - 0, 

k - 1 


(13-1) 
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where a rk is the rkth entry in the incidence matrix A of G, and i k (t) is the 
amount of current flowing through the Arth edge of G. Since Eq. (13-1) holds 
simultaneously for r = 1, 2, it can also be written in the matrix form 

Ai(/) = 0. (13-2) 

Kirchhoff's Voltage Law (KVL ): For any lumped electrical network, at 
any time the net sum (taking into account the orientations) of the voltages 
around a loop (i.e., circuit) is zero. In terms of the corresponding digraph, 
for the rth circuit we must have 


2 b rk v k {t) = 0, (13-3) 

k = 1 

where b rk is the rkth entry in the circuit matrix B of G, and v k (t) is the amount 
of voltage across the Arth edge. Since Eq. (13-3) holds simultaneously for every 
circuit in (7, it can be represented in the matrix form as 


Bv(/) = 0. 


(13-4) 


13-3. LOOP CURRENTS AND NODE VOLTAGES 

Consider the vector space W G (over the field of real numbers) associated 
with the directed graph G. Here G is a connected directed graph of e edges and 
n vertices, representing an electrical network. From Eq. (13-2), we see that the 
edge-current vector i(r) is orthogonal to each of the row vectors in the 
incidence matrix A. Since the row vectors in A span the entire cut-set subspace 
W s (of dimension n — 1), i(/) is orthogonal to W s . Therefore, i(r) lies in the 
circuit subspace W r (of dimension n = e — w + 1) of (7. 

Since i(r) is contained in W r , there must be a set of ju vectors in W T whose 
linear combination will produce \(t). An obvious choice for this set of jj, 
linearly independent vectors in W v is the rows of the fundamental circuit 
matrix B f with respect to some spanning tree. (Clearly, B f is contained in B.) 
Let the coordinates (or coefficients) of i(t) in this basis formed by the rows 
b,, b 2 , . . . , b u of B f be i LX {t), i L2 (t), . . . , i Lu (t). In other words, 


i (0 = 

b-> — 

1 

= [b T M. 

• b^]* 



i 



J-lJL)- 


or \{t) = B} i L (t). 
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Thus each of the e edge currents can be expressed as a linear combination of 
p quantities i Li (t), i L2 {t ),... , i L „(t). These are called loop currents (or mesh 
currents)’, they represent current flowing in the p independent circuits corre¬ 
sponding to the rows of B / . 

Substituting Eq. (13-5) into Eq. (13-2), we get 

(AB*)i L (0 = 0. (13-6) 

Similarly, from Eq. (13-4) we see that the column vector v(t) representing 
the edge voltages is orthogonal to the circuit subspace W r and is, therefore, in 
cut-set subspace W s . Thus v(t) can be expressed as a linear combination of 
the n — 1 rows of the reduced incidence matrix A f . That is, 
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(b) 

Fig. 13-2 Electrical network and its graph. 
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That is, 

v(0 = A T f v N (t). (13-7) 

Thus each of e edge voltages can be expressed as a linear combination of 
n — 1 quantities v NX (t), v N2 (t ),... , v N(n _ v (t). These are called node voltages, 
and they represent the voltage at each of /? — 1 independent vertices with 
respect to the reference vertex. 

Substituting Eq. (13-7) into Eq. (13-4), we get 


(BA/) Vyv (/) = 0. (13-8) 

Let us now illustrate with an example the loop currents and node voltages 
and how they are obtained from the edge currents and edge voltages, respec¬ 
tively. Figure 13-2(a) shows an electrical network with five vertices and seven 
edges. The corresponding directed graph is shown in Fig. 13-2(b). For this 
graph the reduced incidence matrix A f with respect to vertex N 5 and the 
fundamental circuit matrix E^, with respect to the spanning tree (1, 4, 5, 7] 
(shown in heavy lines), are 
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The edge-current vector expressed in terms of loop-current vector is 


i it) = B }\ L (t), 
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The edge voltages in terms of the node voltages (with respect to N s ) are 


v(0 = A T f v N (t), 



13-4. RLC NETWORKS WITH INDEPENDENT 
SOURCES: NODAL ANALYSIS 

In this section we shall restrict ourselves to electrical networks containing 
resistors, inductors, and capacitors (RLC) with independent voltage and 
current sources. In spite of its inherent simplicity, the RLC network covers a 
very large class of electrical networks in practice. In fact, it has been shown 
by Brune and Bott and Duffin that any time-invariant, two-terminal, linear, 
passive electrical element can be formed by a combination of R, L, and C 
(with real positive values of R, L, and C). A further stipulation may be made, 
without any loss of generality, that the voltage sources may only be connected 
in series with RLC elements and that current sources may only be connected 
in parallel with these elements. This stipulation allows us to convert all the 
energy sources either into a set of voltage sources or into a set of current 
sources. 

Nodal Analysis: Consider an RLC network in which all energy sources 
have been converted into current sources. At each node combine all these 
current sources. Let the net current entering from the current sources into the 
rth node be j r (t). For the n — 1 independent nodes, let the column vector 



The n — 1 linearly independent equations from KCL can be expressed as 

A / i(0 = j(0, (13-9) 
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where A f is the reduced incidence matrix of the corresponding graph, and 
i(7) is the e by 1 column vector of currents in each of the e passive edges. 
Taking the Laplace transform of Eq. (13-9), 

A f \{s) = J(j). (13-10) 

But the voltage-current relation in the k\h edge, consisting only of RLC 
elements, is given by 

/*(*) = Y k (s)V k (s), (13-11) 

where I k (s) is the Laplace transform of the current through the kth edge, 
V k (s) is the Laplace transform of the voltage across the fcth edge, and Y k (s) is 
the admittance (or self-admittance) of the kth edge. Writing Eq. (13-11) for all 
the edges in matrix form, 






~y l (sy 

his) 



0 

v 2 (s) 

Jeis )_ 


0 

YXsX 

y.(s)_ 


More compactly, 

!(*) = Y(j)V(j), 


(13-12) 


where I(j) is the Laplace-transformed column vector of the edge currents, 
V(j') is the Laplace-transformed column vector of the edge voltages, and Y (s) 
is the edge admittance matrix. 

Substituting Eq. (13-12) into (13-10), 


AyY(j)V(j) = J (s). (13-13) 

Eq. (13-7) provided a means of expressing the edge-voltage vector in terms of 
the node-voltage vector. Taking the Laplace transform of Eq. (13-7), 


VC*) = A }V N (s), (13-14) 

and substituting Eq. (13-14) into (13-13), 

[A f Y(s)A T f ]V N (s) = J(s) (13-15) 

or Y N (s)\/ N (s) = J(s). 


The (n — 1) by (n — 1) matrix A f Y(s)A} is called the node admittance matrix 
and is written as Y N (s). Note that in deriving Eq. (13-15) it was assumed that all 



x(t) Current source 



.*(/) 




v(t) Voltage 
source 


v(t) 

R 


Fig. 13-3 Passive RLC network and its graph. 







sec. 13-4 


RLC NETWORKS WITH INDEPENDENT SOURCES 


365 


the initial conditions were zero. This too implies no loss of generality, because 
any energy stored in capacitors or inductors at time t — 0 can always be 
replaced by an appropriate energy source and hence incorporated into j(/). 

Let us illustrate these concepts with an example. An RLC network with 
two independent sources—one voltage source and one current source—is 
given in Fig. 13-3(a). Figure 13-3(b) shows an equivalent network with only 
current sources. A directed graph of the network is shown in Fig. 13-3(c). 
The reduced incidence matrix A f (with vertex 4 as reference) is 

a be d e 

i r—i —l o oo 

A f = 2 0 11 10- 

3 L o 0 0 —1 1_ 


The edge admittance matrix is 



The node admittance matrix Y N (s) = A f Y(s)Af is 


~YJLs) + Y b (s) —Y b (s) 0 

YM= —Y b (s) T^)+T c (-y)+W -Y&) 

0 -Y d (s) Y d (s)+Y e (s\ 


J(s) column vector for this example is 


J (J) = 


X(s) 

V(s) 

R 


Network Analysis Problem: Let us pause for a moment and focus on the 
problem that we are solving. The general problem of network analysis can be 
formally stated as follows: Given a network whose structure determines 
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matrix A, given its edge admittance matrix Y(s), and given the current source 
vector J(j), find the node voltages. [If edge voltages or edge currents are 
required, they can be readily obtained using Eqs. (13-7) and (13-12).] 

This clearly requires solving Eq. (13-15), which involves inversion of the 
matrix Y N (s). Inversion of a matrix (which must be nonsingular, of course) 
requires computation of its determinant and of all its cofactors. The conven¬ 
tional determinant technique is inefficient because of extra labor involved in 
computing many terms that eventually cancel out. Moreover, the entries in 
Y N (s) consist of polynomials in s, and must be carried in literal form until 
after the matrix inversion. Therefore, the usual methods of matrix inversion 
are computationally difficult to implement. 

Both these problems are circumvented by using graph theory to evaluate 
the determinant and cofactors. For this we invoke the Binet-Cauchy theorem 
(Appendix A) and use the fact that a major determinant (or simply major) of 
the reduced incidence matrix A f is nonzero if and only if it corresponds to a 
spanning tree. 

Determinant of the Node Admittance Matrix: Let us denote by A N the 
determinant of the node admittance matrix Y N (s). That is, 

A n = det Y N (s) = det[AyY(j)A51. 

Using the Binet-Cauchy theorem, 

A n = sum of products of all pairs of corresponding 

majors of [AyY(j)] and A/. (13-16) 

Had every branch in the network been a 1-ohm resistor, Y (s) would be an 
identity matrix and det Y N (s) would equal det (A f A}), which is equal to the 
total number of the spanning trees in the network (Chapter 9). But for an 
RLC network, in general, Y(s) is not an identity matrix. It is, however, diago¬ 
nal, and therefore A f Y{s) has the same structure as A f except that the kth 
column in A f is multiplied by Y k (s). Every nonzero major determinant in 
A f Y(s), as well as A f , still corresponds to a spanning tree of the network. 

If we call the product of all n — 1 edges of a specific spanning tree a tree 
admittance product, Eq. (13-16) becomes 

A n — Yj tree admittance product. (13-17) 

all spanning trees 

Equation (13-17) was proposed by Maxwell and hence is known as Maxwell's 
formula. To calculate the node admittance determinant by Maxwell’s formula, 
one must find all the spanning trees of the network, multiply the n — 1 edge 
admittances of each spanning tree, and then add the resulting products. Let us 
illustrate Maxwell’s formula for the network of Fig. 13-3. The spanning trees 
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of this graph are abd, abe, acd, ace , ade, bed , bee, and bde. Multiplying the 
edge admittances in each spanning tree and adding them, we get the determi¬ 
nant of the node admittance matrix A N : 


A 


N 


Y£s)Y„(s)Y d (s) + nwnwnw + nwnww + wnwnw 
+ tow + nwnwuw + wm + ww/i) 

i c ' s + zb)( C2S + zb)zb + ( c,s + zbX 02 * + 


+ ( c,s + ib)r; + ( c,s + zb )bf + ( C|i + zb) 
+ ( C2S + zb)z: +1 02 - 5 + zb)bf s + ( C2i + zb) 


1 


RL 3 s 


IRL,s 


^(C,C 2 + C,Cj ) C 2 C 3 ) + J-(C,C 2 + C,C 3 + C 2 C 3 ) 

I (C 2 + c, c, + cy c, + CA I (C 2 + C, C, -+ C 3 \ 

+ ~R\ L, + L 2 + ~L, > + iZ^V Z., + L 2 j 

, ' ( ' , _!_ , M , 1 

h s 2 «U,L 2 ^ L 2 L 3 " r L 3 L,) + s 3 L,L 2 L, 


Note that to compute A N we do not need to write Y^s). Also note that no 
terms are canceled in this method of computing A N . The reader is urged to 
compute det Y^C?) directly from matrix Y N (s) and verify that it equals the 
expression for A N just obtained. Observe the large number of terms that 
cancel in the process of directly evaluating det Y N (s). Also note that A N is 
independent of the reference vertex chosen because the trees of a graph do not 
depend on the reference vertex in writing A f . 


Cofactors of Y iV (s) and 2-Trees 


Evaluation of cofactors of the node admittance matrix Y^s) is slightly 
more involved than det Y N (X). Let the cofactor of the ijth entry in Y N (s) be 
designated by A u . Then by definition 

A - /( 1 /+y ) /determinant of (n — 2) by (n — 2) submatrix of Y^iA 

\ / \left after deleting its ith row and y'th column. / 

Since Y N (s) = A f Y(s)A}, deleting the ith row from A f and they'th column from 
A} will delete the /th row andy'th column from Y^s), respectively. Moreover, 
deleting the y'th column from A} is equivalent to deleting the j'th row in A f . 
Therefore, 

A,-, = (-1 ) i+J det [Af.yisXAf-jYl (13-18) 

where A^,. denotes the submatrix of A f remaining after its /th row has been 
deleted. 
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If A f is the reduced incidence matrix of a graph G, what does matrix A f _ ( 
represent? Matrix A r _,. is the reduced incidence matrix of the graph G, 
obtained from G by fusing its /th vertex with the reference vertex and removing 
any self-loop resulting from the fusion (Problem 13-14). 

Let us first evaluate symmetric cofactors A u , which according to Eq. 
(13-18) is 

A,. = det[A / _ i .Y(5)(A / _,.f ], 

and the right-hand side of this equation is simply the sum of the tree admit¬ 
tance products for the graph G r Therefore, 

A u = sum of tree admittance products for all (13 19) 

spanning trees of G r 

Now look at a spanning tree of G ( . as a subgraph of the original graph G. 
This subgraph has n — 2 edges, n vertices, and no circuits. Therefore, it must 
consist of two components (one of which may possibly be an isolated vertex). 
Such a subgraph is called a 2-tree of G. For example, in Fig. 13-4 the subgraph 
ad is a spanning tree of G 3 and is a 2-tree in G. (Note that G 3 is obtained by 
fusing vertex 3 to the reference vertex 4 and removing the resulting self-loop 
of edge e.) 



4 (3.4) 

G Gj 

(a) (b) 

Fig. 13-4 Spanning tree of Gj is a 2-tree (3, 4) of G. 

Moreover, in this 2-tree of G, the vertex / and the reference vertex r must 
occur in different components; otherwise, fusing them would yield a circuit. 
Such 2-trees in which two specified vertices occur in different components are 
designated by 2-tree (/, r ). For example, in Fig. 13-4 subgraph ad is a 2-tree 
(3, 4). Thus Eq. (13-19) can be rewritten as 

A„- = sum of admittance products of all 2-trees (/, r ). (13-20) 

Let us now use Eq. (13-20) to evaluate the cofactor A 33 of the network in 
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Fig. 13-3. It has five 2-trees of (3, 4) type, and these are, as seen from Fig. 
13-4(b), 

ab, ac, ad, be, and bd. 

Therefore, 


k 3 3 


Y a (s)Y b (s) + Y a (s)Y c (s) + Y a (s)Y d (s) + Y b (s)Y c (s) + Y b (s)Y d (s) 

s 2 (c l c 2 + c,c 3 + c 2 c 3 ) + ( C] + C3 + -- ' - £ - c - 2 + ° 2 + C3 ) 


+ 


-f— 

s 2 \L i L 2 


+ 




) 


To evaluate A u , the cofactor of an off-diagonal entry, observe that in Eq. 
(1-3-18) the nonzero majors of A f -iY(s) correspond to 2-trees (/, r ), where r is 
the reference node. The nonzero majors of A f _ } correspond to 2-trees (j, r ). 
The terms that contribute to A u in Eq. (13-18) must be due to both 2-trees of 
(/, r ) and (j, r ). Since a 2-tree has only two components and vertex r must be in 
one of the components, both i and j vertices must be in the other. Such a 
2-tree is designated by a 2-tree (ij, r ). Thus 



±(-l)' +y 


/sum of admittance products of 
\all 2-trees (ij, r). 


(13-21) 


In Eq. (13-20) we did not have to worry about the sign of the nonzero 
majors, because corresponding majors of both Ay.,, and (A / _ i ) r had the same 
sign. The situation in Eq. (13-21), however, is different. Since A f _; and Ay_ ; 
are different matrices, we have no assurance that the signs of the products of 
the corresponding majors will be positive. In fact, it can be shown (Problem 
13-15) that 


product of corresponding nonzero majors of Ay_ ( . and A f _j = (—1) /+/ . 

(13-22) 

Therefore, 


A u = sum of admittance products of all 2-trees (ij, r ). 
Returning to the example of Fig. 13-3, once more 

A 31 = sum of admittance products of all 2-trees (31, 4) 

= nww=f* + t4-ti ■ 

i ~/ 3 l / 21 > 3 A 


A 32 = Y a (s)Y d (s ) + Y b (s)Y d (s) 

+ C 2 , 1 


(13-23) 


1 
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Node Voltages: Now we can compute any node voltage required. For 
example, the voltage at node 3 in Fig. 13-3 is given by 


V 3(5) = 


A 3i X(s) + A 32 X(5) 4 A, 


Network Functions: Now that we have formulas for the determinant and 
every cofactor A u of the node admittance matrix, any network function that 
was originally expressed in terms of node admittance matrix can now be 
expressed in terms of various tree-admittance products. For example, the 
open-circuit transfer function of a three-terminal network in Fig. 13-5(a) 
(all driving currents zeroed except/,), taking 4 as the reference, is 


H( \ _ A 3 ,( 5 ) __ XI 2-tree (13, 4) admittance product 

~ V,(s) ~ A n (s) — Yi 2-tree (1,4) admittance product 


Formulas like these are called topological formulas for networks. 



(b) 

Fig. 13-5 Three-terminal RLC network. 
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Applying this topological formula to the network in Fig. 13-5(b), which is 
the same as the network in Fig. 13-3 with its driving sources removed, we get 


H(s) = 


_ Y b (s)YJjs) _ 

y b (s)Y d (s) + Y b (s)Y e (s ) + Y c (s)Y d (s ) + Y£s)Y.{s) + YJts)Y.(s) 




L,C,R) ^ L,C 


13-5. RLC NETWORKS WITH INDEPENDENT 
SOURCES: LOOP ANALYSIS 

In Section 13-4, had we considered KVL instead of KCL (converting any 
current source into an equivalent voltage source), we would have obtained a 
set of p = e — n + 1 simultaneous loop equations, 

B f Z(sW f \ L (s) = E(s), (13-24) 

where B f is the fundamental circuit matrix of the network with respect to 
some spanning tree, and B£ is its transpose. The e by e matrix Z(j) is the edge 
impedance matrix, describing the electrical property of each of e edges in the 
network; that is, 


'v,( s y 
v> CO 


-Z,(s) 

Z 2 (s) 

0 

h(s) 

v,(sy 


0 

Z e (s)_ 

Ms). 


Note that for an RLC network the edge impedance matrix Z(s) is the inverse 
of its edge admittance matrix Y(s). I L (^) is the Laplace transform of the loop 
current vector i L (/), and E(^) is the Laplace transform of the voltage sources 
(or equivalent voltage sources) applied externally in the ju fundamental 
circuits. 

The step-by-step derivation of Eq. (13-24) is similar to the derivation of 
Eq. (13-15) and is left as an exercise (Problem 13-9). 

The by ju matrix ^ f Z{s)^> T f in Eq. (13-24) is called the loop impedance 
matrix and is usually denoted by Z L (^). Thus Eq. (13-24) is rewritten as 


Z L (sUs) = E (5). 


(13-25) 
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Li 



(b) 


Fig. 13-6 Network of Fig. 13-3(a) for loop analysis. 


For example, consider the electrical network of Fig. 13-3(a), once again. 
By replacing the current source x(t ) with an equivalent voltage source, we get 
the network as shown in Fig. 13-6(a) and its graph as in Fig. 13-6(b). 


Z L (s) = 


(r 


LjS 


T~ C x L x s z 


+ ... +_L\ (__L\ 

^ 1 + C 2 L 2 s 2 + C 3 s) V C,s) 


(-i) 


(<5 + + R )_ 


AliC 5 ) 


-I 

r x(s)l 2 s I 

> and E(j) = 

1 + L 2 C 2 s 2 

- 

_ ~V{s) _ 


The solution of Eq. (13-25) requires obtaining the determinant and cofactors 
of Z L (s). The expression for A L , the determinant of Z L (.s), according to the 
Binet-Cauchy theorem is given by 

A l = det Z L (s) = det^Z^BJ) 

= sum of products of all pairs of corresponding (13-26) 
majors of [B r Z(^)] and B£. 
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Since a major of B / is nonzero if and only if it corresponds to a chord set, 
Eq. (13-26) becomes 

A l = sum of chord impedance products for all ^ 

spanning trees of the network. 

Equation (13-27) was originally given by Kirchhoff for a purely resistive 
network. For the network of Fig. 13-6(b), all possible chord sets are ce, cd, 
be, bd, be, ae, ac, and ad. Therefore, 


Al = Z c (s)Z e (s ) - 4 - Z£s)ZJis) + Z b (s)Z e {s) + Z b (s)Z d (s) 

+ Z b (s)Z c (s) + Z a (s)Z e (s) + Z a {s)Z d (s) + Z a (s)Z c (s ) 

— _i_ Lla i RL 2 s L 2 L 3 s 2 RL x s 

C 3 s^C 3 ^ 1 + C 2 L 2 s 2 ^ 1 + C 2 L 2 s 2 ^ 1 + C x L x s 2 

4_ ■ _|_ Lz _i L x L 3 s 2 

+ C 3 ( 1 + C x L r s 2 ) + C,( 1 + C 2 L 2 s 2 ) + 1 + C x L x s 2 

The expressions for the cofactors of Z L (s) both symmetrical and asym¬ 
metrical can be obtained in a fashion similar to those for Y^) (Problems 
13-11 and 13-12). 

Note the duality between the nodal and loop analyses (Problem 13-16). 


13-6. GENERAL LUMPED, LINEAR, 

FIXED NETWORKS 

Topological formulas for A N , A L , A u , and so on, derived in the last two 
sections were dependent on two important restrictions on the network: 

1. Existence of edge admittance matrix Y(^) [or edge impedance matrix 
Z(j)], which implied that the network elements were lumped, linear, 
and time invariant. 

2. The edge admittance matrix Yfy) [and therefore also Z(s)] was diagonal. 
This implied that there was no mutual coupling between edges of the 
network. Thus three- or four-terminal devices (which produce couplings 
between two vertex pairs), such as transformers, transistors, tubes, and 
gyrators, could not have been included. 

In this section we shall still retain restriction 1, but do away with 2. This 
will allow us to handle a general linear network containing lumped, linear, 
time-invariant, r-terminal (r > 2) elements—passive devices like transformers 
(which are bilateral also) and gyrators (which are nonbilateral), as well as 
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(b) 


a b c d 



(c) 

Fig. 13-7 (a) Network with a transformer and a transistor; (b) Its 
equivalent network; (c) Graph representation of (b). 

active devices, such as tubes and transistors. An example of such a network is 
shown in Fig. 13-7. 

In the network in Fig. 13-7 (which has six edges and five vertices), we 
observe that the current through edge 5 is dependent not only on V s , but also 
on V 4 , the voltage across edge 6. Similarly, the currents through 1 and 2 are 
dependent on the voltage across each other. Thus edges 1 and 2 are mutually 
coupled and so are 5 and 4. (Edges such as 3 and 6 that have no coupling with 
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any other edge are called ordinary edges.) The edge admittance matrix Y(s) is 
shown in the following equation, l(V) = Y(s)V(.s) for the network: 


where 


rwi 



Y 1 2 ; 
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and 

Y 6 

= gc + c c s. 


Clearly, Y(s) is not diagonal. 


Node Admittance Matrix 


Just as in Section 13-4, Kirchhoff’s current law in its Laplace transformed 
form will yield 


A f Y(s)A T f V„(s) = J N (s), 


and therefore the node admittance matrix is 


Y n (s) = A f Y(s)A}. 


(13-28) 


The difference between Eqs. (13-15) and (13-28) is only that in (13-15) matrix 
Y(^) was diagonal, whereas it is not diagonal in Eq. (13-28). 

For the network and its graph shown in Fig. 13-7, 



a 

b 

c 

d 


1 2 
T 0 
0 1 
0 0 
0 0 


3 4 5 
0 0 0 
1 0 0 
-1 10 
0 0 1 


6 

0 “ 

0 

1 

-1 


and the node admittance matrix Y N (s) = A f Y(s)A} is 

Y , Y n 0 0 

v , . ^12 Y 2 + Sb —gb 0 

Y/v(s) = 

0 -g b g b +Yt + Y 6 -Y 6 

_ 0 0 - y 6 


Gl + Yj 
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Determinant A N 

Again, our aim is to evaluate the determinant and cofactors of the node 
admittance matrix Y^O). We write 

A n = det Y^s) = det A f Y(s)A T f . 

Using the Binet-Cauchy theorem, 

A n = 2 products of corresponding majors of A f Y(s) and A} 

= £det[A / Y( J )]..det[A / ]., ‘ } 

a 

where the subscript a denotes a set of n — 1 columns of A f Y(s) and A f (same 
as a set of n — 1 rows of A}). Thus a also denotes a set of n — 1 edges of the 
corresponding graph. In Eq. (13-29), Y(s) is not diagonal; therefore, the prod¬ 
uct A f Y(s) is not as simply related to A f as it was in Eq. (13-15). So we apply 
again the Binet-Cauchy theorem to evaluate det [A / Y(j , )] a . And since 

[A ,Y(j)]. = A f [Y(s)] a , 

we get 

det [A,Y(*)]. = £ det [A,], • det [Y(j)]£, (13-30) 

p 

where [A f ] p is a set of n — 1 columns of A f and [Y(j)]£ is the corresponding 
set of n — 1 rows of [Y(s)] a . Thus [Y(j)]£ is an (n — 1) by (n — 1) submatrix 
of Y(j). Substituting Eq. (13-30) into (13-29), we get 


An = 2 (S det [A/]/}*det [Y(j)]f^ det [A^. 
= 2det[A / ]..det[A / ] / ,.det[Y( J )]f. 

a, P 


(13-31) 


In Eq. (13-31) the summation is over all possible pairs of sets of n — 1 
edges of the graph, but detJA^ and detfA^ are zero unless a and P corre¬ 
spond to spanning trees of the network, in which case they are +1 or — 1. 
Therefore, 

A*, = 2 6 *p det [Y(a)] a ,/} (13-32) 

a, p 


summed over all possible spanning tree pairs (a, P). The term e aP is the 
product of the signs of spanning trees a and p. 

In general, a and P can represent different spanning trees. If Y(s) is 
diagonal, det [Y(s)]£ = 0, unless a = /?. But if a = p, e a/) = 1 and Eq. 
(13-32) reduces to (13-17). 

But if Y (s) is not diagonal, a spanning tree a, besides making a tree pair 
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with itself, may be able to “pair-up” with some other spanning trees. These 
terms will be contributions to A N due to the couplings between edges. 

Pairs of Spanning Trees: The following method of picking out all pairs of 
spanning trees (a, fi) for which det [Y(V)]£ =£ 0 depends on the fact that for 
any (lumped, linear, time-invariant) electrical network the edge admittance 
matrix Y(s ) can be expressed as 


Y(j) 


Y fs) 


Y 2 Cs) 


0 


0 




I 

I 


Y*(s) 


1 

l 

i 


where the nonzero submatrices Y,(j), Y 2 (s), . . . , Y h (s) are relatively small 
square matrices. See, for example, the edge admittance matrix of the network 
in Fig. 13-7. 

Assuming that we have the list of all spanning trees of the graph, the 
following principle determines which spanning tree pairs with a given 
spanning tree a, such that det[Y(.s)]£ ^ 0. 

The set of rows f must be selected such that [Y(.y)]£ contains no row or column 
entirely of zeros. Therefore, if a contains edge p, and column p in Y(.y) contains 
nonzero entries in rows x, y, . . . , then must contain one (or more) of the 
edges x, y, . . . . Thus, if column p falls in the submatrix Y fe (j) of Y(s), at least 
one of the rows must also be in the submatrix Y k (s). 

A corollary of the observation just made is that if spanning tree a contains 
an ordinary edge u, /? must also contain that ordinary edge u. 

Let us illustrate the selection principle by means of the example of the 
network in Fig. 13-7. The graph has eight spanning trees: 

(1,2, 3, 5), (1,2, 3, 6), (1,2, 4, 5), (1, 2, 4, 6), (1,2, 5, 6), 

(1,3, 4, 5), (1,3, 4, 6), (1,3, 5,6). 

Since 3 and 6 are ordinary edges, the following five are the only candidates 
for possible pairings out of the total of (8 x 7)/2 = 28 pairs of spanning trees: 

1. {(1, 2, 3, 5) and (1, 3, 4, 5)}: both have edge 3. 

2. {(1, 2, 4, 6) and (1, 2, 5, 6)}: both have edge 6. 

3. ((1, 2, 3, 6) and (1, 3, 4, 6)}: both have 3 and 6. 

4. {(1, 2, 3, 6) and (1, 3, 5, 6)}: both with 3 and 6. 

5. {(1, 3, 4, 6) and (1, 3, 5, 6)}: both with 3 and 6. 
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The existence of the same set of ordinary edges is only a necessary and 
not a sufficient condition for pairing. 

Let us now apply the tree-pair selection principle to nonordinary edges: 

1. If Y k (s) is a 2 by 2 square submatrix, it corresponds to a transformer or 
a gyrator, and its contribution to A N is 


Y*(*) 


a b <— row p 
c d <— row (p 1) 


column p column (p + 1) 


or or 


If a Contains 

Column p 

' Column (p + 1) 

Both p and (p + 1) 
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(P + 1) 

i p | (p +1) 
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• i 
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A n due to Y fc (s) is 
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1 C 1 d 

1_1_ 
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or or 


2. If Y^) is a 2 by 2 triangular matrix, it corresponds to a transistor or a 
vacuum tube. In that case 


Y*(s) 


a 0 <— row p 
c d <— row (p + 1) 


column p column (p + 1) 


or or 


If a Contains 

Column p 
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An due to Y k (s) is 
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d 

1 

j ad 


or 


In light of these two tables, let us look at the five tree pairs that are 
possible candidates in the network of Fig. 13-7. 

Three of the five pairs, 1,3, and 4, do not form valid tree pairs, because in 
each of the three one spanning tree contains both edges 1 and 2, while the 
other one contains only edge 1. 
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The remaining two pairs [(1,2, 5, 6), (1, 2, 4, 6)] and [(1,3, 5, 6), (I, 3, 4, 
6)] satisfy the tree-pair-solution criterion, and their contributions to A N are 

{Y X Y 2 -Y\ 2 )g m Y 6 and Y x g b g m Y 6 , 

respectively. 

The criterion of selection of pairs of spanning trees can be easily extended 
to Y k (s) of sizes larger than 2 by 2 (see [13-7]). 

Signs of Tree Pairs: In the case of a spanning tree a consisting of ordinary 
edges only, the spanning tree pairs only with itself, and we need not know if 
det [Ay]* = +1 or — 1 because 

6 aa = det [Ay] a -det[Ay] a = +1. 

But for tree pairs (a, fi) consisting of nonordinary edges (and therefore 
a ^ fi), we must know the relative (not absolute) signs of the spanning trees 
in each pair. 

According to the method of sign determination discussed in Chapter 9, 
for the tree pair in Fig. 13-7, 

[(1,2, 5,6), (1,2, 4, 6)], €a , = +1, 

and for 

[(1,3, 5, 6), (1,3,4 6)], c a/ , = +1. 

Thus A n as a sum of the eight spanning admittance products (a, a pairings) 
and two additional terms due to (a, fi) pairings is expressed as 

Ayv = (YJ 2 - Y\ 2 )g b G L + {Y x Y 2 - Y\ 2 )g b Y 6 + (Y X Y 2 - Y\ 2 )Y,G L 
+ (Y x Y 2 - Y* 2 )Y 4 Y 6 + (Y x Y 2 - Y\ 2 )G l Y 6 + Y lgb Y 4 G L 
+ Y x g b YJ 6 -F Y lgb G L Y 6 + {Y x Y 2 - Y 2 l2 )g m Y 6 + Y lgbgm Y 6 . 

Note once again that there is no cancellation of terms. 

The derivation of cofactors A u for active networks can be carried on 
similarly by a combination of the technique discussed in Section 13-4 and the 
use of spanning-tree pairs. 


SUMMARY 

The technique developed in this chapter can be extended to solve any 
linear-system problem. Roughly speaking, any linear-system problem can be 
expressed in the following form: 


AX* - Y*, 
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where A is a linear operator, Y* a known vector, and X* an unknown vector 
for which the solution is sought. 

A standard method of solving this equation is to find an operator A -1 
(assuming it exists and is unique), the inverse of A, and then to premultiply 
both sides to obtain the required vector 

X* = A-‘Y*. 

In an electrical network consisting of lumped, linear, time-invariant 
devices, the problem consists of solving a set of simultaneous, linear, dif¬ 
ferential equations with constant coefficients. Application of the Laplace 
transform converts these differential equations into linear algebraic equations. 
Thus the operator A is a matrix whose entries are functions of 5, the Laplace 
variable, and Y* is the vector of independent driving voltages (or currents). 

Thus the electrical network problem (like most linear-system problems) 
consists of matrix inversion, which is the same as finding the determinants and 
cofactors. And all that has been done in this chapter is to show how graph 
theory can be used (rather than algebra) to evaluate determinants and 
cofactors of the nonsingular matrix A, if A could be expressed as a triple 
matrix product 

A = PMP r , 

where P is a unimodular (0, l)-matrix—a reduced incidence (or fundamental 
cut-set or fundamental circuit matrix) of a graph—describing the “structure” 
of A; and M is a matrix describing the values of the nonzero entries in A. 

The same approach can be used for solution of any lumped, linear, time- 
invariant system, provided a “system graph” can be found. This has a direct 
bearing on the realizability problem discussed in Section 12-5, as to when a 
given unimodular matrix P can be the cut-set or circuit matrix of a graph. 

Whether there is any computational advantage in using graph theory for 
network analysis is totally dependent on whether one can generate all 
spanning trees, 2-trees, and the like, of a large graph rapidly and without 
duplication. A graph of moderate size (20 vertices and 50 edges) could have 
several million spanning trees. Even the storing of all the trees in a computer 
memory can be a problem. The algorithm should therefore be such that 
spanning trees are rapidly generated, one at a time, and its admittance 
product is added to or subtracted from (depending on the sign) the cumula¬ 
tive sum. The algorithm should guarantee that no spanning tree will be 
generated twice, so that one does not have to check every newly obtained tree 
against all the trees previously generated. Moreover, the algorithm must also 
guarantee that no spanning tree in the graph is left out. 

As discussed in Chapter 11, a number of algorithms for generating all 
spanning trees of a graph have been proposed in the literature. The best ones 
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do generate one spanning tree at a time without duplication and generate all 
spanning trees. But the algorithms are still not as efficient as one would like 
them to be. 
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PROBLEMS 

13-1. Show that Kirchhoff’s voltage and current laws imply “conservation of power.” 
[Hint: Using Eqs. (13-5) and (13-7), show that 2*=i v*(t)i/e(0 = 0.] 
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13-2. An electrical network with e edges has 2e unknowns (the current through and 
voltage across each edge). Identify the 2e independent equations, and discuss the 
existence and uniqueness of the solutions. 

13-3. Kirchhoff’s current law may be expressed in more general form as follows: The 
net sum (taking into account the orientations) of all currents flowing across a 
cut-set is zero. Using a cut-set matrix (instead of incidence matrix) and this form 
of KCL, develop equations parallel to Eqs. (13-2), (13-7), (13-8), (13-15), and 
(13-17). 

13-4. In Fig. 13-2(b) list all spanning trees and all 2-trees (N 3 N U N s ). 

13-5. In Fig. 13-4(a) sketch all 2-trees (2, 4) and all 2-trees (23, 4). 

13-6. Of Kirchhoflf’s and Maxwell’s formulas, which one will you prefer for evaluating 
A n and A t ? Why? 

13-7. In Fig. 13-2, assume the resistance value /?,• or capacitance value C,- (as the case 
may be) in the /'th edge of the network. Let x(t) be the value of the independent 
voltage source shown, and let N s be the reference node. Convert x(t) into an 
equivalent current source in parallel with R i. Use Maxwell’s formula to evaluate 
A N and A 31 . Using these two quantities, evaluate the voltage at node N 3 . 

13-8. In Problem 13-7, keep the voltage source in series with R 1 . Write the loop- 
impedance matrix Z(s) of the network. Write Z L (s). Evaluate A L using Eq. (13-27). 
Evaluate the appropriate cofactor of Z L {s) required for obtaining V2(s). Finally, 
obtain Vz(s), and compare the result with that of Problen 13-7. 

13-9. In a step-by-step fashion derive Eq. (13-24). 

13-10. For an RLC network, prove that 

^ = y,u)y 2 u)... Ye(s). 

[Hint: Yi(s)Zi(s ) = 1, and therefore for each spanning tree T in the network, 

admittance product of the spanning tree T _ 
impedance product of chord sets with respect to T 

[Y l (s)-Y 2 (s)... Y e (s).] 

13-11. Similarly to Eq. (13-20), show that the (/, /) th cofactor of the loop impedance 
matrix Z L (s) of an RLC network G is equal to the sum of the chord impedance 
products for all spanning trees of the network G', obtained from G by deleting 
the ith chord. 

13-12. Attempt an expression for the (/',/')th cofactor of the loop impedance matrix Z l 
of an RLC network. 

13-13. In deriving expressions for An and Al, we tacitly assumed the nonsingularities of 
V^U) and Z L (s). Discuss the requirements imposed on an electrical network 
because of the nonsingularity requirements. [Hint: The network should have 
( 1 ) each voltage source only in series with some passive element, ( 2 ) each current 
source only in parallel with some passive element, and (3) no perfectly coupled 
transformer; i.e., L\Lz > M\ 2 .] 

13-14. Let A/ be the (n — 1) by e reduced incidence matrix of a connected (directed 6 r 
undirected) graph G of n vertices and e edges, with respect to some reference 
vertex r. And let (7,- be the graph obtained from G by fusing its /'th vertex with 
the reference vertex r, and removing any self-loops produced in the process. Prove 
that Athe (n — 2) by e matrix obtained from A/ by deleting its /th row, is the 
reduced incidence matrix (with the fused vertex as the reference vertex) of Gt. 
[Hint: In G, the n — 2 vertices have exactly the same incidences as they had in G. 
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Correspondingly, the n — 2 rows of A f are left intact in AThe edges incident 
between r and / are gone, but the edges that were incident on either r or i but 
not on both have one end incident on the fused vertex.] 

13-15. In Problem 13-14, show that the product of any two corresponding nonzero majors 
of (m — 2) by e unimodular matrices A/_,- and A/_,• is equal to (—l) i+J , providing 
the rows and columns of both these matrices are arranged in the same order. 

13-16. Draw a dual electrical network to the one in Fig. 13-2, and then study the dual 
relationship between various quantities between the two networks, such as the loop 
equations in one being the node equation in the other. 

13-17. For a one-port RLC network, shown in Fig. 13-8(a), show that the driving point 
admittance at terminals (1, r) is 

2 tree admittance product 
2 2-tree (1, r) admittance product 


I A (0 




(a) (b) 

Fig. 13-8 One- and two-port RLC networks. 

13-18. A two-port network has four short-circuit admittance functions. Derive the 
topological formula for each in the two-port RLC network with common reference 
vertex r shown in Fig. 13-8(b). 





GRAPH THEORY IN 
OPERATIONS RESEARCH 


Graph theory is a very natural and powerful tool in combinatorial opera¬ 
tions research. In earlier chapters we have already applied graph theory to 
operations-research problems. The traveling-salesman problem (Chapter 2), 
finding the shortest spanning tree in a weighted graph (Chapter 3), obtaining 
an optimal matching of jobs and men (Chapter 8), and locating the shortest 
path between two vertices in a graph (Chapter 11) are some examples of the 
uses of graph theory in operations research. This chapter will be devoted 
entirely to solving problems in operations research using graph-theoretic 
tools. We shall consider three related areas of operations research in which 
graph theory is used most frequently and profitably. They are transport net¬ 
works, activity networks, and the theory of games. 

14-1. TRANSPORT NETWORKS 

In Section 4-6 we saw how a graph can be used as a model for a network 
of pipelines through which some commodity is transported from one place 
to another. The general problem in such a transport network (also called a 
flow network) is to maximize the flow or minimize the cost of a prescribed 
flow. This is an operations-research problem and can be solved by linear 
programming, but the graph-theoretic approach has been found to be com¬ 
putationally more efficient. In this section we shall see how network-flow 
problems can be formulated and solved using graphs. Let us first define some 
terms. 

Transport Network: A simple, connected, weighted, digraph G is called 
a transport (or flow) network if the weight associated with every directed edge 
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in G is a nonnegative number. In a transport network this number represents 
the capacity of the edge and is designated as c u for the edge directed from 
vertex i to vertex ./. A transport network is shown in Fig. 14-1, where the 
numbers written beside the edge are the edge capacities. 


t 


Fig. 14-1 Transport network. 

The capacity c u of an edge (/,./) can be thought of as the maximal amount 
of some commodity (such as water, gas, electrical energy, number of cars, 
bits of information, etc.) that can be transported from station i to./', along the 
edge (i,j), per unit of time in a steady state. Then a natural question is: What 
is the maximal amount of the commodity flow from a given vertex s to anoth¬ 
er specified vertex t via the entire network? Let us first formulate the question 
mathematically. 

Maximal flow: In a given transport network G, a flow (or a static flow) 
is an assignment of a nonnegative number/.,. to every directed edge (i,j) such 
that the following conditions are satisfied: 

1. For every directed edge (/, j) in G 

Lj < c tr (14-1) 

2. There is a specified vertex 5 in G, called the source , for which 

E/., - E/,. = 04-2) 

i i 

where the summations are taken over all vertices in G. Quantity w is 
called the value of the flow. 

3. There is another specified vertex t in G , called the sink, for which 

E// - E//r = ” H ’- 



(14-3) 
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4. All other vertices are called intermediate vertices. For each inter¬ 
mediate vertex j. 


S/y, - E/,y = 0. (14-4) 

i i 

Condition (14-1) states that the flow through any edge does not exceed its 
capacity. The other three conditions state that the net flow out of the source 
is w, the net flow into the sink is w, and the flow is conserved at each inter¬ 
mediate vertex. This is why w is called the value of the flow from .s to /. Con¬ 
dition (14-3) can, in fact, be derived from (14-2) and (14-4), and is therefore 
not independent. It is understood that if there is no edge from vertex p to q, 
f pq — 0. An edge (i,j) for which f ij — c u is said to be saturated. 

A set of flows fi/s for all (/,/)’s in G is called a flow pattern. A flow pattern 
that maximizes the quantity w is called a maximal flow pattern. The first 
problem one encounters in a transport network is: Given G, s, and /, find a 
maximal flow pattern. 

Linear Programming Formulation: Those familiar with linear program¬ 
ming (LP) will recognize this as an LP problem. As an example, take the 
transport network in Fig. 14-1. The variables are the flows through each of 
the 10 edges. Although w — f sb + f sd —f cs , we can regard w as another 
variable. Let the flow pattern be denoted by a column vector f: 

fsd 

fcs 

fbc 



fcb 

fcs 

fdt 

fdc 

-ftd_ 

and let 


Lf J 

denote the variable vector of the LP problem. Let h denote the row vector 


( 1 , 0 , 0 , 0 , 0 , 0 , 0 , 0 , 0 , 0 , 0 ). 
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Then the problem is to maximize h*f' subject to the constraints 


where 


A'*f' = 0, f<c, 


and f > 0, 


A' = 


0 

0 

0 

1 


1 

-1 

0 

0 

0 


1 -1 
0 0 

0 1 

-1 0 

0 0 


0 

1 

-1 

0 

0 


0 

1 

0 

0 

-1 


0 0 0 0 (T 

-10000 
1 10-10 
00 1 1-1 
0 - 1-1 0 1 


and 


c 


4 

5 
1 

5 
2 

6 
4 
2 
1 
1 


Observe that A' is the incidence matrix of a digraph obtained by adding an 
edge from t to s in the transport network of Fig. 14-1. Also note that the edges 
in f, c, and A' must appear in the same order. 

Clearly, a maximal flow can be obtained by solving this LP problem, but, 
as mentioned earlier, the graph-theoretic approach is more efficient. Using 
the graph-theoretic concept, we shall now state and prove the max-flow min- 
cut theorem, the most important result in the theory of transport networks. 

Cut and Its Capacity: Ignoring the directions of edges in a transport 
network, let us consider a cut-set with respect to vertices ^ and t, that is, a 
cut-set which separates the source 5 from sink /. Such a set of edges in a trans¬ 
port network is called a cut. The notation ( P , P) is used to denote a cut that 
partitions the vertices into two subsets P and P , where P contains 5 and P 
contains t. The capacity of a cut denoted by c(P, P) is defined to be the sum of 
the capacities of those edges directed from the vertices in set P to the vertices 
in P; that is, 

2 c u = c(P, P). 

i€ P 
jeP 
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For example, in Fig. 14-1 the cut (dashed line) separating P = {j, b } from 
P — {c, d, t} has a capacity of 5 -f 5 + 2 = 12. 

Theorem 14-1 

In a given transport network G , the value of flow w> from source 5 to sink t is 
less than or equal to the capacity of any cut separating j from t. 

Proof: Let ( P , P) be an arbitrary cut such that the source s is in vertex set P 
and the sink ? is in vertex set P. Let us write Eq. (14-4) for all intermediate vertices 
in P and add them to Eq. (14-2). This yields 


which can be rewritten as 


2 fpi ~ 2/i,= w > 


p&p 

iec 


pep 

ieG 


But 


Therefore, 


2 fpi + 2 fpi - 2 ftp - 2 fip = w. 

peP peP peP pep 

ieP ieP ieP ieP 


2 fpi - 2 ftp = o. 

pep pep 

ieP ieP 


2 fpi - 2 ftp = w. 


pe Z 

ieP ieP 


(14-5) 


Since 2 ftp is always a nonnegative quantity, we have 


ieP 


w < 2 fpi < 2 c(p, o = c(p, P). 


pep 

ieP 


pep 

ieP 


In the following theorem we shall prove that it is possible to achieve a 
value of the flow which equals the capacity of the smallest cut separating s 
from t. 


Theorem 14-2 (Max-Flow Min-Cut Theorem) 

In a given transport network G, the maximum value of a flow from s to t is 
equal to the minimum value of the capacities of all the cuts in G that separate 5 
from /. 

Proof: In view of Theorem 14-1 we need only to prove that there exists a flow 
pattern in G such that the value of the flow w 0 from 5 to t is equal to c(P 0 , P 0 ), 
the capacity of some cut ( P 0 , P 0 ) separating j from t. 

Let there be some flow pattern in G such that the value of the flow from s to t 
is at its maximum possible value kv Define a vertex set P in G recursively as 
follows: 

(a) s e P. 

(b) If vertex / e P and either f i} < c i} or f jt > 0, then j e P. Any vertex not 
in P belongs to P. 
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Now vertex t cannot be in P. If it were, there would be a path p (see Fig. 14-2) 
from s to t, say, s, v lt v 2 , . . . , v h v j+ u . . . , v k , t, for which in every edge either 
flow < c V) v ux , or fv uy v, > 0. In path p an edge (v Jy v j+l ) directed from v } to 
v j+ ! is called a forward edge and an edge (v j+1 , vj) directed from v J+ ! to v } is called 
a backward edge (Fig. 14-2). 




Fig. 14-2 Path p in the proof of Theorem 14-2. 


In path p let <5 t be the minimum of all differences [c v ,v Ux — fv i v u ] in forward 
edges and d 2 be the minimum of all flows in backward edges. Both <5i and S 2 
are positive quantities. Let <5 = min^j, S 2 ). Then the flow in the network G can 
be increased by increasing the flow in each forward edge and decreasing the flow 
in each backward edge by an amount 8. [Conditions (14-1), (14-2), (14-3), and 
(14-4) are still satisfied.] This contradicts the assumption that vv 0 was the maximum 
flow. 

Thus t must be in the vertex set P. In other words, the cut ( P , P) separates s 
from t. Furthermore, according to condition (b), for each vertex p in P and i in P, 
we have 

f P i = c pi and f {p = 0. 


Therefore, from Eq. (14-5) we get the value of the flow: 

wo = S fpi - 2 ftp 
P e Z p*=Z 

ieP ieP 

= 2 c pi = c{P , P), 

P i%p 

which proves the theorem. ■ 


As an example, let us consider the transport network of Fig. 14-1, once 
again. It has eight (2 3 ) cuts that separate s from t. These cuts (identified by 
vertex set P) and their capacities are 


Vertex Set P 

c(P, P) 

{s} 

9 

{5, b } 

12 

{s, c} 

19 

{ 5 , d] 

7 

[5, b, c] 

11 

{s, b, d] 

10 

(s, c, d] 

16 

{^, b, c, d] 

8 
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The cut with minimum capacity among these is the one in which P = {s, d ) 
and P is {b , c, t}. The maximum flow possible in ^ to t in the network is there¬ 
fore 7 units. 

The proof does not include an algorithm for finding the actual value of 
the maximal flow w max . Nor does it give a flow pattern that realizes this maxi¬ 
mal flow. If we were interested only in finding w max , we would take some 
algorithm for generating a minimal cut (see, for instance, Plisch [14-19] for 
an efficient computer code to generate all minimal cuts in a given transport 
network), and then compute its capacity. 

For those wanting to construct a maximal flow pattern, an algorithm 
based on the foregoing proof of the max-flow min-cut theorem is also avail¬ 
able. This is an efficient algorithm, and it uses a vertex-labeling process for 
constructing a maximal flow pattern. However, the proof that this algorithm 
terminates in a finite number of steps depends on the edge capacities being 
integers. For more on this labeling algorithm and its modifications, see 
[14-7], [14-9], or [14-12]. 

14-2. EXTENSIONS OF MAX-FLOW MIN-CUT 
THEOREM 

The max-flow min-cut theorem as stated is applicable to a transport 
network (simple, weighted, connected digraph) with one source and one sink. 
There are, however, many other types of network-flow problems that can be 
solved by extending the max-flow min-cut theorem appropriately. Some of 
these extensions are straightforward and others are quite involved. Let us 
consider them in increasing order of difficulty. 

1. Multiple Sources and Sinks: If there are several sources s t , s 2 , . . . , s k 
and several sinks f,, t 2 , . ., , t r and if the flow from any source can be sent 
to any sink, then this problem can be converted immediately into a one- 
source and one-sink problem as follows: Introduce a supersource s with 
edges (of unlimited capacity) directed to j,, s 2 , • • •, s k and a supersink t with 
edges (also of unlimited capacity) directed from t { , t 2 , • • • , t r , as shown in 
Fig. 14-3. The problem of maximizing the total value of the flow from all 
sources is then the same as that of maximizing the value of the flow from s 
to t. 

However, if the restriction is made that the flow from a specified source 



Fig. 14-3 Multi-source multi-sink transport network. 
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s t must be sent to a specified sink the problem becomes much more diffi¬ 
cult. Such a flow, known as the multicommodity flow, will be discussed short¬ 
ly as a separate topic. 

2. Vertices with Specified Capacity: Suppose that we have a transport 
network in which some (or all) vertices also have specified capacities. The 
total flow into a vertex v must not exceed its capacity c(v), a real positive num¬ 
ber. This network can be converted into an ordinary transport network by 
replacing each such vertex v with two vertices v' and v" and an edge from v' 
to v" with capacity c{v). All edges originally incident into v are made incident 
into v', and all edges originally incident out of v are made incident out of v", 
as illustrated in Fig. 14-4. 



Fig. 14-4 Replacement of a vertex v with v and v”. 


3. Networks Containing Undirected Edges: Often one encounters the 
problem of maximizing a flow through a network in which some or all of the 
edges are undirected. In such a network an undirected edge between vertices 
p and q of capacity c pq implies that the flow can occur in either direction, and 

f < c 
f < c 

Moreover, since simultaneous flows in opposite directions cancel each other, 
the flow is assumed to be in only one direction. That is, 


f -f = 0 . 

J pq J qp 

Thus the maximum-flow problem in a network containing undirected edges 
can be solved by replacing each undirected edge with a pair of oppositely 
directed edges, each having a capacity of the original edge.t 

fin the case of a multicommodity flow, where two or more commodities flow through 
the network simultaneously, this replacement is not valid. The product f pq 'f q p ^ 0, because 
different commodities can flow in opposite directions without canceling each other. This 
is one of the major difficulties in multicommodity flow problems. 
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4. Lower Bound on Edge Flows: So far we have assumed that the lower 
bound on a flow through an edge in a transport network is zero. Occasionally, 
one encounters a practical situation that requires a minimum flow b u through 
an edge (for instance, an oil pipeline in Alaska may need a specified minimum 
flow to keep it from freezing). That is, conditions 

b t j<fu<c t j (14-6) 

replace (14-1), where b u is a nonnegative real number no larger than c ij . 

For some network there may not even exist a feasible flow pattern, that is, 
one which satisfies the constraints (14-2), (14-3), (14-4), and (14-6). For exam¬ 
ple, all/] 7 = 0 is not a feasible flow pattern, unlike in the case with no lower 
bounds on the edge flows. Therefore, we have to first determine if indeed 
there is a flow pattern in G that satisfies all the upper and lower bounds, and 
if so how do we get a flow. 

It can be shown using the arguments of Theorem 14-1 that the value w 
for any feasible flow pattern must satisfy the following simultaneous require¬ 
ments for every cut ( P , P) in G separating 5 and t : 

w < c(P, P) - b(P, P), 
w > b(P, P) - c(P, P), 

where 

b(P, P) = 2 b tJ and b(P, P) = £ b ir 

iep TeP 

jeP jep 

Furthermore, analogous to Theorem 14-2, it can be shown that if there 
exists a flow pattern satisfying the lower and upper bounds, a maximum flow 
can be achieved, and the value of the flow equals the minimum value of the 
quantity 

c(P, P) - b(P, P) 

taken over all cuts (P, P) separating vertices s and /. Similarly, the minimum 
value of a flow equals the maximum value of 

b(Q, Q) - c(Q, Q) 

taken over all cuts ( Q , Q) separating vertices s and t. 

The problem of determining conditions under which a flow pattern exists 
satisfying constraints (14-2), (14-3), (14-4), and (14-6) is slightly more in¬ 
volved. The reader is referred to Liu [8-3], pages 270-275, or Chapter 2 of 
Ford and Fulkerson [4-3] for further discussions. 

5. Lossy Networks: So far we have assumed that the flow does not vary 
along an edge. In many practical transport networks, however, the flow does 
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suffer loss during transmission, due to leakage, evaporation, and so forth, 
Such networks are called lossy transport networks (or lossy networks). 

A lossy network has an additional parameter, called efficiency, A, 7 , as¬ 
sociated with each directed edge (/, j). For each edge (/, j ) there are two flows: 
flow f ij entering the edge and flow /* leaving the edge. These quantities are 
related as follows: 

f* = Krfij, for each edge 0 \j) in G. 

The efficiency X u is a positive number. It is less than unity if there is a loss 
during transmission and is more than unity of there is a gain (for instance, 
improvement in the signal due to repeaters in a communication line). 

At each intermediate vertex the total outgoing flow must still be equated 
to the total incoming flow. The larger of the two quantities f tJ and f* must 
still not exceed c u , the capacity of the edge (i,j). As in the case of ordinary 
transport networks (in which A (7 = 1, for every edge), the goal is to maximize 
the flow arriving at the sink t. Moreover, for the same value of the flow 
arriving at the sink, we may have different values of flow leaving the source. 
Therefore, another goal is to find a flow pattern that gives the maximum flow 
arriving at the sink for a minimum amount leaving the source. This is called 
an optimal flow in a lossy network. 

The max-flow min-cut theorem has been extended to lossy networks. 
Conditions for optimality have been obtained, and algorithms for optimal 
flows have been devised. For details, see the paper by Onaga [14-18] or pages 
277-288 in [14-7]. 

14-3. MINIMAL-COST FLOWS 

Suppose that associated with each edge (i,j) in a transport network G 
there is an additional number d u , which may be thought of as the cost of unit 
flow through (i,j). It is desired to construct a flow pattern sending a specified 
value w from source s to sink t satisfying constraints (14-1), (14-2), (14-3), 
and (14-4), which minimizes the total flow cost, 

2 d/j'ftp (14-7) 

(<W)£G 

over all flows that send w units from 5 to t. 

This is one of the most practical problems in network flows. It is also a 
classic problem in linear programming and is known as the transportation 
problem. Many problems in operations research can be formulated as a 
transportation problem. 

To find a flow pattern that minimizes the cost, we start with a minimal- 
cost directed path from s to t and saturate this path (i.e., assign a flow to the 
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path such that at least one edge in the path reaches its capacity). Then by us¬ 
ing the following theorem recursively we obtain the minimal-cost flow pat¬ 
tern of desired value. Let us call a path from s to t unsaturated for a given 
flow in G if f u < c iJ for every forward edge (i,j) and/ 7 > 0 for every back¬ 
ward edge (see Fig. 14-2). 

Theorem 14-3 

Let / be the minimal-cost flow pattern of value w from s to /. Then the flow 
pattern f', obtained from / by adding S < 0 to the flow in forward edges of a 
minimal-cost unsaturated path, and subtracting <5 from the flow in the backward 
edges of the path, is a minimal-cost flow of value w + S. 

This theorem is of central importance in constructing minimal-cost flow 
patterns. For a formal proof of this intuitively obvious result, the reader is 
referred to Ford and Fulkerson [4-3], pages 121-122. Theorem 14-3 states 
that at every stage of construction each additional unit of flow is to be sent 
through the least-cost available path. All unsaturated paths from 5 to t are 
available paths, and in computing the cost of an available path p one takes 
into account not only the cost of adding the flow to the forward edges in p 
but also the savings due to reduction of existing flows in the backward edges 
of p. Let us illustrate the application of the theorem with an example. 

In Fig. 14-5 we have a transport network. Of the pair of numbers written 
next to an edge, the first number is the capacity c u and the second one is the 
cost d t j of a unit flow. To find a minimal-cost maximal flow from s to t, we 
go through the following steps. 




>’ 8 
(c) 


Final Flow Pattern 
Fig. 14-5 Minimal-cost flow. 
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1. The minimal-cost path is .sy*/, and total path cost is 4. We can send 
a maximum possible flow of 11 units through this path, thus saturating 
edge ( y , x) in this path. 

2. We modify the network by subtracting 11 from the current capacities 
of all edges in syxt. Set d yx = oo. 

3. In the modified network, the minimal-cost path from s to t is sxt. The 
cost is 5. We sent the maximum possible flow of 3 units through sxt , 
which saturates edge (x, t) in the path. 

4. We further update the network by sending the capacities in the path 
sxt and setting d x( = oo. 

5. In the resulting network the minimal-cost path is syzt of cost 6 and 
capacity 5. Sending 5 units of flow through syzt saturates (5, y). 

6. Appropriate updating yields the network in Fig. 14-5(b). 

7. In Fig. 14-5(b) the minimabcost path is sxyzt with a cost of 4 — 2 + 
3 + 2 = 7. Sending 3 units along this path saturates a cut-set and thus 
the algorithm terminates. The desired flow pattern obtained is given 
in Fig. 14-5(c). The value of the flow from s to t is 11 + 3 + 5 + 3 = 
22 units, and the cost is4x 11+5x3 + 6x5 + 7x3 = 110. 


14-4. MULTICOMMODITY FLOW 

In some practical situations it becomes necessary to deal with several dis¬ 
tinct commodities flowing simultaneously through a given transport network. 
Each commodity has its own source and its own sink. All flows share the 
edge capacity, and therefore, as in the single-commodity case, the sum of all 
flows through an edge must not exceed the capacity of the edge. For each 
commodity the flow is preserved at every intermediate vertex. 

For illustration, let us consider the transport network in Fig. 14-6 through 
which commodities 1 and 2 are flowing. Commodity 1 is to be transported 
from 5 , to t, and commodity 2 from s 2 to t 2 . 

For a two-commodity case, let f} } and/?• be the flows of commodities 1 



Fig. 14-6 Two-commodity transport 
network. 
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and 2, respectively, through an edge (/, j ) in G. Then, analogous to the single¬ 
commodity case, the constraints in Fig. 14-6 are 

fh +fh < c tj 

and 




w, 

for i 

= *i. 

2 

/ (Z fl 

fh = < 

-w, 

for i 

= C > 



0 

for all other vertices, 




for i 

2 ’ 

2 

S (Z 

fh = ' 

-w 2 

for i 

^2» 

J tl/ 


.0 

for all other vertices. 


These constraints can be easily written down for a ^-commodity flow. In 
such flows two problems are usually raised: (1) Construct patterns for all 
^-commodities such that the total sum of the flow values w, + w 2 + • • • + 
w k is maximized; (2) Given the flow values w,, w 2 ,«. . , w k for each com¬ 
modity and a network G, find out if these values of flows can be achieved 
simultaneously. 

Simply sending the maximum amount of each commodity will not in 
general maximize the total value. This can be seen even in the simple case of 
Fig. 14-6. If we maximize alone, we get w x = 15 and w 2 = 0. On the other 
hand, the maximum value of w, + w 2 is obtained with w, = 5 and w 2 = 20. 
Thus to maximize the total value, we must know how to allocate commo¬ 
dities to each edge. 

There is no result similar to the max-flow min-cut theorem for the multi- 
commodity flow in general. Only in some special cases (such as when G is 
undirected and there are only two commodities) has it been possible to get 
a theorem analogous to the max-flow min-cut theorem. 

For further reading in this specialized and rather involved topic, the inter¬ 
ested reader is referred to Chapter 11 of Hu’s book [14-12], Chapter 3 of 
Frank and Frisch [14-7], and the Ph.D. dissertation of Sakarovitch [14-20]. 

14-5. ADDITIONAL APPLICATIONS 

We have been discussing how various types of shipping problems can be 
solved by means of network-flow techniques. In addition to these, there are 
a surprisingly large number of combinatorial problems in operations research 
that can be formulated (and then solved) as network-flow problems. Take 
for instance the matching or assignment problem discussed in Section 8-4. 
We have p men M,, M 2 ,. . . , M p and q jobs7,,7 2 ,. . . ,J g , and it is known 
which men are qualified for which jobs. When is it possible to fill all jobs with 
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qualified men or when is it possible to assign each man a job he is qualified 
for? 

The problem can be formulated as a network-flow problem, as shown in 
Fig. 14-7. Construct a /^-source (/-sink flow network, such that an edge (A/,., 
J k ) exists if and only if man M c is qualified for job J k . Join all sources to a 
supersource s and all sinks to a supersink t. Assign capacities of one unit to 
each (5, Mj) and to each (J t , t ). The capacities of the remaining edges are 
made infinite. Then the optimal assignment problem becomes that of con¬ 
structing a flow pattern with maximum value from s to /. 



Observe that in such a flow pattern 


fsM, 

= 1, 

if z'th man is assigned to a job, 


= 0, 

otherwise, 


= 1, 

if A:th job has been assigned, 


= 0, 

otherwise, 

f 

' M,Jk 

-1, 

if /th man is assigned to the kth job, 


= 0, 

otherwise. 


More complicated personnel assignment problems have been formulated 
in terms of network flow. Numerous other types of problems have also been 
solved as flow problems. For these the reader is referred to the bibliography 
in the survey paper by Fulkerson [14-9]. 

14-6. MORE ON FLOW PROBLEMS 

Flow problems may be looked upon as a generalization of connectivity 
problems, studied in Chapter 4. The study of connectivity involves a search 
for paths between pairs of vertices in a graph. A path from a vertex * to a 
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vertex y implies that some amount of flow can be sent from x to y. To find 
how much, we have to consider the capacities of the edges in the path. 

The maximum number of edge-disjoint paths between a pair of vertices 
jc, y is equal to the minimum number of edges that when removed from the 



1-Commodity 
Multicommodity 



1-source, 1-sink 
Multiple sources and sinks 


Vertex capacities unlimited 
Vertex capacities limited 
Only directed edges 
Mixed edges 

Edge capacities with upper bounds only 
Edge capacities with lower bounds also 
Lossless edges 
Lossy edges 

Flow-cost equal for all edges 
Flow-costs different for different edges 
Capacities deterministically known 
Capacities probabilistically known 
Static flow 
Dynamic flow 

Fig. 14-8 Some types of flow networks. 
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graph leave no path between x and y. This number is precisely the number of 
edges in the smallest cut-set with respect to x and y. This concept, when ap¬ 
plied to a graph with edge capacities, becomes the max-flow min-cut theorem. 
It equates the sum of maximum capacities of paths between x and y to the 
capacity of the minimum cut-set with respect to x and y. 

It is natural to seek results as elegant as the max-flow min-cut theorem for 
more general networks. Some generalizations are easily made. Others, such 
as for the multicommodity flow, have not been possible so far. A summary 
of some common types of flow networks is given in Fig. 14-8. 

It is interesting to compare transport networks with electrical networks, 
studied in Chapter 13. A transport network can be thought of as a special 
type of resistor network that obeys Kirchhoff’s current law (KCL), but not the 
voltage law (KVL). Moreover, the resistors have no resistance for currents 
(i.e., flows) up to a certain value c u and then have an infinite resistance for 
current larger than that. In such a network no voltage (potential, pressure, or 
tension) exists across any branch. 

Conversely, an electrical network problem can also be formulated as a 
flow problem. Consider a resistor network G with current sources in which 
we wish to find currents (i.e., flows) f u flowing through every edge (i,j). The 
upper and lower bounds on the currents are 

C u = oo, 
btj = -oo. 

The flow pattern must satisfy KCL; that is, 

£/„ - I ]fu = 0 04-4) 

i i 

for every vertex j in G. The flow pattern must also satisfy KVL. It was ob¬ 
served by J. C. Maxwell in 1893 that among all flow patterns satisfying (14-4) 
the one that minimizes the power dissipation 

£ Wfh (14-8) 

for all 
( i,j)eG 

is the one that satisfies Kirchhoff’s voltage law also. Quantity r u is the elec¬ 
trical resistance of the edge (i,j). [That minimization of (14-8) is equivalent 
to satisfying KVL, assuming KCL, in a resistive network with current 
sources is left as an exercise.] 

Thus an electrical network problem can be viewed as a flow problem, 
which minimizes a quadratic flow-cost function (14-8) subject to linear con¬ 
straints (14-4). Obviously, then, an electrical network problem (subject to 
KCL and KVL) is not an LP problem. 
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14-7. ACTIVITY NETWORKS IN PROJECT 
PLANNING 

One of the most popular and successful applications of networks in 
operations research is in the planning and scheduling of large complicated 
projects. The two best-known names in this connection are CPM (Critical 
Path Method) and PERT (Program Evaluation and Review Technique). A 
project is divided into many well-defined and nonoverlapping individual 
jobs, called activities. Due to technical restrictions, some jobs must be finished 
before others can be started (such as washing before drying, putting founda¬ 
tion before erecting walls, etc.). In addition to this precedence relationship 
among the activities, each activity also requires a certain time, called the 
duration of the activity. Given the list of activities in a project, the list of 
immediate prerequisites (i.e., predecessors) for each activity, and the dura¬ 
tions, a weighted digraph can be drawn to depict the project, as follows: 
Each edge represents an activity, and its weight represents the duration of the 
activity. The vertices represent beginnings and endings of activities and are 
called events or milestones in the project. An activity (/,/) cannot be started 
before all activities leading to the event / have been completed. Each event in 
the project is a well-defined occurrence in time (such as walls erected, ship¬ 
ment arrived, etc.). Such a weighted, connected digraph representing activi¬ 
ties in a project is called an activity network. 

Let us take an extremely simple example. Suppose that we have a project 
consisting of six activities A, B , C, D, E, and F, with the restriction that A 
must precede C and D ; B and D must precede E; and C must precede F. The 
durations for the activities A, B, C, D, E, and Fare 5, 7, 6, 4, 15, and 2 days, 
respectively. The activity network of this project is shown in Fig. 14-9. 



Fig. 14-9 Activity network. 


Observe that an activity network must be acyclic; otherwise, we would 
have an impossible situation in which no activity in the directed circuit could 
be initiated—a vicious cycle. Also observe that the vertex denoting the start 
of the project must have zero in-degree, since no activity precedes this vertex. 
Likewise, the vertex denoting the termination of the project must have zero 
out-degree, as no activity follows this vertex. 
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Dummy Activity: In the example of the activity network considered in 
Fig. 14-9, suppose we had an additional restriction that activity F could not 
be started before B and D were completed. We can incorporate this prece¬ 
dence relationship by drawing an edge from vertex x to^ (Fig. 14-10). Such an 

Dummy activity 



Fig. 14-10 Dummy activity in a network. 


edge, which represents only a precedence relationship and not any job in the 
project, is called a dummy activity. Dummy activities become necessary when 
the existing activities are not enough to portray all precedence relationships 
accurately. All dummy activities are of zero duration and are usually shown 
in broken lines. 

Two parallel edges (i.e., activities having the same immediate predecessor 
and the same immediate successor) may be replaced by a single edge, com¬ 
bining both activities into one [Fig. 14-11(a)]. If, however, the activities are 
to be kept track of separately, then a dummy activity and a dummy event 


max [t x , t y 1 

- * -• 

x and y 


(a) 




Fig. 14-11 Replacement of parallel edges. 
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must be created [Fig. 14-11(b)]. And, as there can be no self-loop in an activity 
network, we have only simple digraphs for an activity network. 

An activity network can be assumed to have exactly one vertex with zero 
in-degree and exactly one vertex with zero out-degree. If there is more than 
one vertex having zero in-degree, one arbitrarily selects one of these for the 
start event and draws dummy activities from this to the other vertices. The 
vertices with zero out-degrees are handled similarly. 

In brief, an activity network is a representation of two aspects of a project: 
(1) precedence relationships among the activities, and (2) their durations. It 
is a connected, weighted, simple, acyclic digraph with exactly one vertex of 
zero in-degree and exactly one vertex of zero out-degree. 


14-8. ANALYSIS OF AN ACTIVITY NETWORK 

A newly constructed network should first be checked for any directed 
circuit. A directed circuit implies inconsistency in the network, which must be 
corrected. Although Theorem 9-17 gives an algorithm for finding whether or 
not a digraph has a directed circuit, a more efficient method is provided by 
topological sorting of vertices, defined as follows: 

Topological Sorting: The vertices of a digraph G are said to be in topologi¬ 
cal order if they are labeled 1,2,3,...,« such that every edge in G leads 
from a smaller numbered vertex to a larger one. That is, for every edge (/, j) 
in G we have / < j. The process of relabeling the vertices such that they are in 
a topological order is called topological sorting. Clearly, if a digraph contains 
a directed circuit, it is not possible to put its vertices in a topological order. 
The following construction procedure shows that the vertices of every acyclic 
digraph G can be put in a topological order. 

Start with a vertex with zero in-degree and label it 1. Delete vertex 1 from 
G, and in the remaining digraph (G — 1) find a vertex with zero in-degree and 
label it 2. From ((7—1) delete vertex 2 and repeat the process, till either (1) 
every vertex is labeled, or (2) we find a subdigraph g in which there is no ver¬ 
tex with zero in-degree. In view of Theorem 9-15, case (2) is possible only if 
g contains a directed circuit. Thus we can state 

Theorem 14-4 

The vertices in a digraph can be arranged in a topological order if and only if 
the digraph is acyclic. 

Topological sorting performs two functions in an activity network: (1) 
it detects directed circuits, if any, in the network, and (2) it puts the events in 
a topological order 1, 2, 3,...,«, where 1 is the start event and n is the com¬ 
pletion event of the project. 
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Topological sorting is an important process in many problems besides 
activity network analysis. For example, if we want to arrange the words in 
a glossary so that no term is used before it has been defined, we resort to a 
topological sorting. We shall therefore present an algorithm for this impor¬ 
tant process in a step-by-step fashion. 

Algorithm for Topological Sorting 

1. Set i <— 1. 

2. Find an unlabeled vertex with zero in-degree, and label this vertex i. 
If no such vertex exists, go to step 4. 

3. Set i <— i + 1; and go to step 2. 

4. If every vertex in G has been labeled, stop. Otherwise, go to step 5. 

5. If the out-degree of any vertex labeled so far is nonzero, remove all 
edges incident out of every labeled vertex and go to 2. If there are some 
unlabeled vertices and the out-degree of each of the labeled vertices is 
zero, we have a directed circuit in the network, stop. 

Note that there may be more than one topological ordering of the vertices 
in a given acyclic digraph, because at step 2 in the algorithm it is possible to 
have more than one vertex with zero in-degree. 

Critical Path: Having made sure that the activity network G contains no 
directed circuits and (in the process) having placed the vertices of G in a 
topological order 1,2our next task is to determine the project dura¬ 
tion. The minimum time required to complete the entire project is equal to 
the length (i.e., sum of the activity durations) of the longest directed path in 
G. (The longest directed path is, of course, from 1 to n .) The longest directed 
path is called a critical path (CP). The vertices and edges in a CP are called 
the critical events and the critical activities, because any delay in them will 
delay the entire project. In Fig. 14-9 the critical path is ADE and the project 
duration is 5 + 4 + 15 = 24 days. There may be more than one critical path 
in a given activity network. 

Instead of determining the longest path only from 1 to n, let us determine 
the longest paths frorp vertex 1 to every vertex k in G, where k = 2, 3,, . . , n. 
The length of the longest path from 1 to A: is called the earliest event time for 
event k, because this is the earliest possible time at which event k can be 
realized. 

Since digraph G is acyclic, the method of obtaining shortest paths from a 
specified vertex to all others, given in Chapter 11, can be easily modified for 
finding the longest paths. In fact, since the vertices are already topologically 
ordered, the task is even simpler. Let 


tu = duration of activity (i,j) in G, 
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and let 

T(k) = length (i.e., time) of longest path 

from 1 to k, for k = 1, 2, 3, . . * , n. 

Clearly, T{\) — 0. Vertex 2 can be reached only from vertex 1 (because of the 
topological order), and therefore 

T(2) = 7(1) + t l2 = t xl . 

Vertex 3 cannot be reached from any vertex except from 1 and 2. Therefore, 

T{ 3) = max[r(l) +t x3 , T( 2) + t 23 \. 

Similarly, vertex 4 can possibly be reached only from 1, 2, and 3. Therefore, 
T( 4) = max[T(l) + t l4 , T(2) + t 24 , T(3) + t 3A \. 

And so on. The general expression can thus be written as 

T(k) = max[r(/) + t ik l (14-9) 

i<k 

where the maximum is over all vertices i from which there is a directed edge 
(/, k) to vertex k. 

The solutions of these equations can be performed one by one, and T(l), 
T( 2), . . . , T{ri) obtained successively. Let us take a simple example: 



Fig. 14-12 Activity network. 


An activity network consisting of 8 events, 12 activities, and 1 dummy 
activity is shown in Fig. 14-12. The event labels are shown inside the small 
circles representing the vertices. They are in topological order 1, 2, 3,. . . , 8. 
(Since the vertices are identified and the digraph is simple, the activity labels 
have been dispensed with.) The durations of activities (in some unit of time) 
are shown next to the edges. Let us compute the earliest event times T(i) for 
/ = 1, 2, 3, . . ., 8. 
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With successive application of Eq. (14-9), we get 

n4) = max[f 14 , r(2) + t i4 ] 

= max[7, 9] = 9, 

T(5) = max[r(2) + t 25 , T(3) + t 35 , T(4) + r 45 ] 

= max[10, 16, 12] = 16. 

Similarly, 

7X6) = 21, r(0 =16, 7X8) = 28. 

Thus the project duration T(8) = 28. The critical path is 1,3, 5, 6, 8, and is 
shown in heavy lines in Fig. 14-12. 

This computation of earliest times of topological ordered events by trac¬ 
ing the longest paths from vertex 1 to vertices 2, 3, . . . , n, successively, is 
referred to as forward calculation. 

Latest Event Time: To ensure that the project is finished at time T{n), we 
have to make sure that none of the critical activities is delayed. There is, 
however, a certain amount of latitude in scheduling noncritical activities. A 
noncritical activity may be allowed to slip (and thereby save money or nerves) 
to a certain extent without delaying the project completion. The latest time 
by which an event k must be realized without increasing the project duration 
is called the latest event time T'{k). For example, in Fig. 14-12 event 7 may be 
realized latest by time unit 22 (and no later) without affecting the completion 
time of the project. It is not difficult to see that the latest event time is given 
by the relation 

T(k) = T(n) — time taken along longest path from vertex k to n. 

If we reverse the direction of every edge in the network, the vertices will 
still be topologically sorted, but the order would be reversed, n, n — 1, . . . , 
2, 1. Starting from vertex n, one could move toward vertex 1 and compute 
the times taken along the longest paths, using Eq. (14-9) successively in the 
reversed network. Starting with relation 

T'(n) = T(n), 

we get the following recursive relation for the latest event time for vertex k: 

T\k) = min[r(/) - 1 J, 


(14-10) 
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where the minimization is over vertices i to which there is directed edge ( k, i) 
from vertex k. For example, in Fig. 14-12, 

T'(7) = 28 - 6 = 22, 

T'(6) = 28 - 7 = 21, 

T\ 5) - min[T'(7) - 0, T\ 6) - 5] - 16, 

and so on. Table 14-1 shows all earliest and latest event times in the activity 
network of Fig. 14-12. Note that T(i) = T'(i ) if and only if vertex / is in a 
critical path. 


Event / 

no 

TV) 

1 

0 

0 

2 

8 

12 

3 

10 

10 

4 

9 

13 

5 

16 

16 

6 

21 

21 

7 

16 

22 

8 

28 

28 


Table 14-1 Earliest and Latest Event Times 


Slacks: As a measure of maximum latitude available in a noncritical 
activity, let us look at the following quantity called total slack (or float) of 
activity (ij). 

S U = T’U) - m - t tJ . (14-11) 

Quantity s u represents the maximum permissible delay in activity (/, /), which 
is possible when i is realized as early as possible, and j is delayed as much as 
possible. 

Since each activity has two end vertices, each one of which has two time 
values, it is possible to define four different slacks for each activity. We have 
considered only the most important one, the total slack. The second most 
important slack is the free slack, v u , defined as 

v tJ = T(j) - T(i) - t tJ . (14-12) 

This is the amount by which an activity (/,y)can be delayed without delaying 
the early start of any other activity. Total slacks and free slacks for all activ¬ 
ities in the network of Fig. 14-12 are shown in Table 14-2. 

Observe that the total slack s u = 0 if and only if (i,j) is a critical activity, 
whereas v ij may be zero even if (/, j) is not a critical activity. Also note that 
s tJ > v tj > 0. 
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Activity 

Total Slack 

Free Slack 

0,2) 

4 

0 

0,3) 

0 

0 

0,4) 

6 

2 

(2, 4) 

4 

0 

(2, 5) 

6 

6 

(2, 6) 

4 

4 

(3,5) 

0 

0 

(3, 7) 

8 

2 

(4, 5) 

4 

4 

(5, 6) 

0 

0 

(5,7) 

6 

0 

(6, 8) 

0 

0 

(7, 8) 

6 

6 


Table 14-2 Total and Free Slacks of Activities in 
Network of Fig. 14-12 


In the foregoing analysis of a network, called the critical path method 
(CPM), we have accomplished the following: 

1. Checked for directed circuits. 

2. Arranged events in topological order. 

3. Identified critical path (or paths) and computed the project duration. 

4. Computed earliest event time T(k ) for each event. 

5. Computed latest event time T'(k ) for each event. 

6. Computed slacks for each activity. 

Having identified the critical activities, we can concentrate only on these 
and by expediting them reduce the total project duration. Second, the slacks 
can be utilized to reduce the peak demands for certain machines or skilled 
workers. 

Project Cost Curve: Although we have assumed a constant duration for 
each activity, in practice allocation of more money can usually get a job done 
faster. Given a fixed budget for the project, how should the money be allocat¬ 
ed among the activities so that the project is completed at the earliest possible 
date? If for each activity the time-cost relation is linear, this problem can be 
shown to be a minimal-cost flow problem (see pages 151-162 of [4-3] or 
[14-8]). The solution of the problem will be a curve showing project cost 
versus project duration, and, depending on the budget (or the target com¬ 
pletion date), one would pick a point on this curve. Such a curve is known as 
the project cost curve. 

Often in activity networks, in addition to time and cost, there may be 
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other parameters, such as personnel required, shop facilities necessary, and 
so forth, associated with each edge. 

In CPM networks activity durations were assumed to be precisely known. 
If the activity durations are random variables with given probability dis¬ 
tributions, the network goes by the acronym PERT (Program Evaluation and 
Review Technique). Whereas CPM focuses on optimizing the total project 
cost, PERT is more concerned with estimates of completion dates, scheduling 
requirements, and so forth. Activity networks with variations of these two 
are also encountered. 


14-9. FURTHER COMMENTS ON ACTIVITY 
NETWORKS 

In this chapter vertices were used to represent events and the edges to 
represent activities. There is another representation often used in the litera¬ 
ture in which the vertices denote activities and the edges represent only the 
precedence relationships among the activities. Obviously, for a given set of 
activities these two will yield different graphs. For example, the project 
activities of Fig. 14-9 are shown in both representations in Fig. 14-13. 



(a) Event-Vertex Representation (b) Activity-Vertex Representation 

Fig. 14-13 Two representations for the same activities. 


It is not difficult to transform the event-vertex representation into the 
activity-vertex representation and vice versa (see [14-5]). In fact, the activity- 
vertex representation is the edge digraph (see Problem 9-16) of the event- 
vertex representation, if we disregard the dummy activities in the latter. There 
is no need of dummy activities in the activity-vertex representation. Both 
types of representations are widely used in the literature. Each has its own 
slight advantage over the other, but there is no basic difference between the 
two as far as their analyses are concerned. 

The activity network and its application in project planning have been in 
existence only since 1957. In these few years its success has been spectacular. 



sec. 14-10 


GRAPHS IN GAME THEORY 409 


Computer programs for analyzing CPM and PERT networks are part of the 
standard program library of almost every computing center. Large networks 
consisting of thousands of activities are often analyzed. 

Generally, the network is constructed from a list of activities and a pre¬ 
cedence table. Generation of a precedence table is a manual job because it 
involves an intimate knowledge of the processes in the project. The construc¬ 
tion of the activity network, including the dummy activities, from the pre¬ 
cedence table can be relegated to the computer, although it is still done mostly 
manually. Construction of a composite network from subnetworks can also 
be programmed [14-22]. 

A typical computer program for critical-path analysis consists of three 
phases: (1) cycle-checking and topological-sorting phase, (2) forward-time- 
calculation phase, and (3) backward-time-calculation phase. Shortcuts have 
been suggested that can complete the critical-path analysis in a single phase, 
and save computation time in the case of large networks [14-17]. 

We have presented the bare essentials of the activity network analysis. 
Much more can be done with graph theory in project planning. 


14-10. GRAPHS IN GAME THEORY 

The theory of games has become an important field of mathematical 
research since the publication of the first book on the subject by John von 
Neumann and Oskar Morgenstern in 1944. Game theory is applied to prob¬ 
lems in engineering, economics, and war science to find the optimal way of 
performing certain tasks in a competitive environment. 

The general idea of game theory is the same as the one we associate with 
parlor games such as chess, bridge, and checkers. The distinction between 
a puzzle and a game is that in a game one plays against one or more human 
opponents, whereas a puzzle involves a solitary effort to solve a problem. 

A game may be played between two persons, such as chess, or among 
more than two persons, such as poker. The former is called a two-person 
game and the latter an n-person game. Another classification of games is 
based on whether or not an element of randomness is introduced, such as by 
dice or cards. A third element in categorizing a game is whether or not a play¬ 
er has complete information on the position of a game at every move. A game 
such as chess, in which each player knows exactly where the game stands is 
called a perfect-information game. Bridge, in which one does not know what 
cards the other players have, is an imperfect-information game. A game is 
called finite if each player has a finite number of choices available at each 
move and the game must end after a finite number of moves. An infinite game 
is one in which a player chooses a move from an infinite set of moves. 

We shall confine ourselves to the study of two-person, perfect-informa- 
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tion, finite games without chance moves. A digraph is a natural representa¬ 
tion of such a game. The vertices represent the positions (also called states ) in 
the game and the edges represent the moves. There is a directed edge from 
vertex v. to v s if and only if the game can be transformed from position (state) 
v. to Vj by a move permissible under the rules of the game. As an example, 
let us look at a very simple game. It is a simplified version of a game called 
nim. 

Simplified Nim: Two piles of sticks are given and players A and B take 
turns, each taking any number of sticks from any one pile. The player who 
takes the last stick wins, and since the finite quantity of sticks will eventually 
be exhusted, it is obvious that the game allows no draw. As a further simplifi¬ 
cation, let us start with two piles containing two sticks each. The complete 
game is described by the digraph in Fig. 14-14. Each state of the game is 
described by an ordered pair of labels (*, >>), indicating the number of sticks 
in the first and the second pile, respectively. 



( 0 , 0 ) 


Let us observe some properties of such a game digraph (a digraph repre¬ 
senting a two-person, perfect-information, finite game without chance 
moves): 

1. The digraph has a unique vertex with a zero in-degree. This vertex 
represents the starting position in the game and is therefore called the 
starting vertex. Vertex (2, 2) in Fig. 14-14 is the starting vertex. 

2. There are one or more vertices with zero out-degree. These correspond 
to the closing positions in the game, and are called the closing vertices. 
Vertex (0, 0) is the closing vertex in Fig. 14-14. 

3. A game digraph is a connected, acyclic digraph. A directed circuit 
would imply that the game could go on indefinitely. (In practice, in 
a game such as chess, where the game may return to a state, endless 
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matches are prevented by means of a rule that after a certain numbei 
of repetitions of the same move, the game is declared stalemated.) 

4. Each directed path from the starting vertex to a closing vertex repre¬ 
sents one complete play of the game. This path consists of edges re¬ 
presenting the moves of the two players alternately. 

The most important question in a game is the following: When and how 
can a player choose his moves so that he is certain of winning? We shall first 
answer this question for the specific game in Fig. 14-14, and then generalize 
it. 

Let us call a position “won” if the player who brought the game to this 
position can force a victory. Conversely, a position is dubbed “lost” if the 
player who brought the game to this position can be forced to lose. In keep¬ 
ing with this characterization of vertices, the closing vertex in Fig. 14-14 is 
to be marked as won, because the player who brought the game to this posi¬ 
tion is the winner. Having marked this vertex as won, let us use the following 
procedure to mark the remaining vertices as won or lost. 

Mark an unmarked vertex won if all its successors are marked lost, and 
mark an unmarked vertex lost if at least one of its successors is marked won. 
(This is because it is assumed that each player is intelligent and makes the 
best possible move at each stage.) This results in vertices (0, 0), (1, 1), and 
(2, 2) being marked as won and the remaining as lost. And thus the player 
who makes the second move has the winning strategy, since he can force his 
opponent to move to the vertices marked as lost. 

To generalize the foregoing method of finding a winning strategy, let us 
introduce the concept of kernel in a digraph. 

Kernel of a Digraph: A set of vertices A" in a digraph G is called a kernel 
(or nucleus) of G if 

1. No two vertices in K are joined by an edge. 

2. Every vertex v not in K has an edge directed from v to some vertex in 
K. 

Conditions 1 and 2 correspond respectively to definitions of an independent 
set and a dominating set in an undirected graph (recall Chapter 8). What are 
some types of digraphs that have kernels? Theorem 14-5 characterizes one 
such type. 

Theorem 14-5 

Every acyclic digraph has a unique kernel. 

Proof: The theorem will be proved by a constructive procedure, at the end of 
which all vertices forming the kernel will be painted red. Let G be the given acyclic 
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digraph. According to Theorem 9-15, G must have at least one vertex with zero 
out-degree. Let V x be the set of all vertices in G with zero out-degree. Since these 
vertices must all be in the kernel of G, paint them red. 

Next, let W x be the set of all those vertices in G from which there is at least 
one directed edge to some vertex in V x . Clearly, no vertex in W x can be included 
in the kernel. Delete from G all vertices in W x (together with the edges incident 
on them, of course), and thus obtain subgraph ( G — IT,). 

Subgraph (G — W x ) is also acyclic. Let V 2 be the set of all vertices with zero 
out-degree in (G — W x ). Since in the original digraph G no vertex in set V 2 had a 
zero out-degree, every vertex in V 2 had to have at least one edge going to some 
vertex in W x . Moreover, in digraph G no vertex in V 2 could have been adjacent 
to any vertex in V x . Nor could any vertex in V 2 have been adjacent to any other 
vertex in V 2 , because the out-degree of each vertex in set V 2 of (G — W x ) is zero. 

Thus we conclude that every vertex in V 2 must also be included in the kernel 
of G and therefore be painted red. 

This procedure is continued till every vertex in G is either deleted or painted red. 
The unique set of vertices painted red constitutes the kernel of the digraph. ■ 

As an illustration, let us find the kernel in the acyclic digraph in Fig. 14-14. 
It is easily seen that the set of three vertices marked (0, 0), (1, 1), and (2,2) is 
the kernel. 

Let A be the player who makes the first move in the game and B be the 
player who makes the second move. Assuming that the rule of the game is 
such that the player who is able to make the last possible move in the game is 
always the winner, we have the following important result. 

Theorem 14-6 

In the game digraph if the starting vertex is not in the kernel K, then player A 
is assured of a win, and A can win by always selecting vertices in K. 

Proof: Since the starting vertex is not in set K, player A can move the game 
to a vertex x in K. If this vertex x is a closing vertex, A is the winner. If not, the 
second player B will have to move to some vertex y, which is not in the kernel K. 
In his next move A can take the game to some vertex in K. The game continues, 
with B forced to take it out of K and A bringing it back into K. Eventually, the 
play will be brought to a closing vertex by A, because all closing vertices are in K. 
Thus A wins the game. ■ 

Corollary 

It follows from the proof of this theorem that if the starting vertex is in the 
kernel, the second player B has the winning strategy; and B can win by always 
selecting vertices in the kernel. 

In the foregoing analysis the rules of the game were assumed to be such 
that the player who made the last possible move was always the winner. In 
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many games (such as chess or tic-tac-toe) some of the closing vertices rep¬ 
resent a draw and others a win. In such a game choosing vertices from the 
kernel will only assure a player a win or a draw. 

There are also games in which the rule is such that the player who is forced 
to make the last move is the loser rather than the winner. From such a game 
digraph, if we remove all edges corresponding to the last moves, the game can 
be converted to the type in which the winner is the player making the last 
move. In other words, the game is decided at the time the second-to-the-last 
moves are made. For example, let us modify the nim game of Fig. 14-14 such 
that the player forced to take the last stick is the loser. Then erasing the last 
moves from the digraph, we get Fig. 14-15. The three vertices marked won 
constitute the kernel in this digraph. Even in this game player B has the win¬ 
ning strategy. 



Thus, at least in theory, it is possible to construct the game digraph for 
any two-person, perfect-information, finite game with no chance moves, and, 
since the digraph is acyclic, it is also possible to obtain its kernel. Therefore, 
if each player plays according to his best strategy, the outcome is predeter¬ 
mined. It will be either a draw or a certain win for the player who makes the 
first move if the starting vertex is not in the kernel, or for the player who 
makes the second move if the starting vertex is in the kernel. In this sense, 
every game of this type is either “unfair” or “futile.” 

In reality the situation is not so bleak as it appears. In most nontrivial 
games, such as checkers or chess, the number of positions (i.e., the vertices 
in the game digraph) is so enormous that the game digraph cannot even be 
stored in the memory unit of any existing or contemplated computer. 



414 


GRAPH THEORY IN OPERATIONS RESEARCH 


CHAP. 14 


This is precisely why in real problems in operations research the theory of 
games provides an approach rather than a complete analysis. Moreover, 
graph theory is applicable only to a very special but important class of games. 

SUMMARY 

We have considered three important classes of problems in combinatorial 
operations research: transportation problems, activity networks, and game 
theory. These problems can be expressed and solved elegantly as graph- 
theory problems involving connected and weighted (mostly acyclic) digraphs. 
From a practical point of view, all these problems are trivial (and so is any 
combinatorial problem) if the network is small. Many real-life situations, 
however, consist of huge networks, and therefore it is important to look at 
these network problems in terms of solving them on computers. 
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1 C SURVEY OF OTHER 
13 APPLICATIONS 


In the last three chapters we have explored in considerable detail the 
application of graph theory to three disciplines, switching and coding theory, 
electrical networks, and operations research. In this final chapter we shall 
briefly describe how graph theory is used in a number of other areas. The 
first three sections are somewhat related. They all deal with representation of 
a system structure by means of a weighted, connected digraph and subsequent 
analysis of the system through an appropriate study of the digraph. In Section 
15-1 a linear system is modeled as a weighted digraph, which has proved to be 
a convenient tool for analysis. Section 15-2 deals with representation of a 
stochastic process (a discrete Markov process) by a digraph and makes use of 
Section 15-1 for its analysis. Section 15-3 uses weighted digraphs for the analy¬ 
sis of computer programs. A discrete Markov process is an appropriate model 
for many programs, and thus Section 15-2 is made use of in Section 15-3. 

Section 15-4, in which graph theory is used as a tool for identification of 
chemical compounds, is an isolated section. It is, however, an important 
application of graph theory. Finally, Section 15-5 lists some miscellaneous 
applications, with relevant references. 


15-1. SIGNAL-FLOW GRAPHS 

Most problems in analysis of a linear system are eventually reduced to 
solving a set of simultaneous, linear algebraic equations. This problem, 
usually solved by matrix methods, can also be solved via graph theory. The 
graph-theoretic approach is often faster, and, more importantly, it displays 
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cause-effect relationships between the variables—something totally obscured 
in the matrix approach. This graph-theoretic analysis of a linear system 
consists of two parts: (1) constructing a labeled, weighted digraph called the 
signal-flow graph, and (2) solving for the required dependent variable from the 
signal-flow graph. 

In a signal-flow graph each vertex represents a variable and is labeled so. 
A directed edge from x t to x y implies that variable Xj depends on variable x ( . 
(but not the reverse). The coefficients in the equations are assigned as the 
weights of the edges such that the variable x k is equal to the sum of all 
products w ik x., where w ik is the weight of the edge coming into x k from x r As 
an example, let us construct a signal-flow graph for the system given by the 
set of three equations, 


C ll X i “1“ C 12 X 2 C 13 X 3 y 1» 

^2l X i ^22 X 2 "f” C23 X 3 f 2’ (15-1) 

C 3l X l “b C 32 X 2 “I - - C 33 X 3 = .V3’ 

which can be rewritten as 

(c,! + 1)*, + C 12 x 2 + c 13 x 3 —y x = x„ 

C 2l X l “1“ ( C 22 “b 1)^2 ~b C 23 X 3 ^2 = X 2-> (15“2) 

^*3 1-^1 T“ ^32'^2 ~b (^*33 ~b 1)^3 y 3 -^ 3 * 

The signal-flow graph representing Eqs. (15-2) is given in Fig. 15-1. 

Clearly, the in-degree of a vertex v in a signal-flow graph is zero if and 



Fig. 15-1 Signal-flow graph for Eqs. (15-2). 



418 


SURVEY OF OTHER APPLICATIONS 


CHAP. 15 


only if v represents an independent variable. Also note that a signal-flow 
graph is connected; otherwise, we have two uncoupled (unrelated) systems 
thrown together. 

A signal-flow graph can be compared to a signal transmission network, in 
which the vertices corresponding to the independent variables are signal 
sources, and the other vertices are repeaters, which act as receiving, summing, 
and transmitting devices. The signals travel along the edges and are multiplied 
(amplified or attenuated) by the weights of the edges traversed. The label jc. 
of a vertex equals the sum of all incoming signals, and is the strength of the 
signal in each outgoing edge from x r It is from this analogy that the name 
“signal-flow graph” comes. For the same reason the edge weights are called 
edge gains, and independent variable vertices are referred to as source vertices. 

Note that a signal-flow graph contains the same information as the 
equations from which it is derived; but there does not exist a one-to-one 
correspondence between the system of equations and the digraph. From the 
same set of n equations we can obtain n \ different signal-flow graphs (some of 
which may be isomorphic), depending on the order in which the variables 
*/s are written on the right-hand side, say, in Eqs. (15-2). 

Now, given a set of algebraic equations 

Cx = y, (15-3) 

how do we obtain the weight matrix of the signal-flow graph without first 
having to draw the digraph? (Like any weighted digraph, the signal-flow 
graph is completely described by its weight matrix.) 


Theorem 15-1 


The weight matrix W 
(15-3) is given by 


= [Wij\ of the signal-flow graph corresponding to Eqs. 


W = 


C + l -\~] T 


(15-4) 


where I is the identity matrix of the same order as C, and the superscript T denotes 
the transposed matrix. 


The theorem is not difficult to prove and is left as an exercise. Note that 
the columns of all zeros in W correspond to the y vertices (i.e., independent 
variables). 

Although signal-flow graphs can always be constructed from a set of 
equations, in many physical problems, particularly in electrical systems, 
signal-flow graphs are drawn directly without first writing the equations. 
Usually, a signal-flow graph can be drawn as easily as the equations are 
formulated. Also, writing equations from a signal-flow graph is a simple 
matter, because each vertex x k represents one equation of the system in which 
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/ 



x k is equal to the sum of the products of weights of all incoming edges and the 
labels of the initial vertices of these edges. For example, the system of equa¬ 
tions for the signal-flow graph of Fig. 15-2 can be immediately written down 
as 

x t =y i + ax 2 + dx 3 , 
x 2 = bx x +fx 2 , 
x 3 = cx x + ex 2 + gx 4 , 
x 4 = hx 3 +jx 5 + y 4 , 
x 5 = ix 4 . 


These can be rewritten in the same form as Eq. (15-3), where 



C = 


—c 


0 

0 


—a ~d 0 0 

1 -/ 0 0 0 

— e 1 — g 0 > 

0 —h 1 —j 

0 0-/1 




>r 

*2 


0 

*3 

, and y = 

0 

*4 


y 4 

-*5- 


_o_ 


(15-5) 


Reduction of Signal-Flow Graphs 

The signal-flow graph method of analysis is most useful when we want to 
solve for only one unknown variable, say x J} as a function of one independent 
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variable, say y k . We solve by eliminating all other vertices one by one, taking 
care that this elimination process does not alter the net product of the edge 
weights of directed paths from y k to x r This graph reduction corresponds 
exactly to the algebraic method of eliminating all other variables by systematic 
substitution. Some elementary reductions of a signal-flow graph are shown 



b 

■*- 






acd 



b 


Fig. 15-3 Reductions of signal-flow graphs. 
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in Fig. 15-3. Repeated application of such reduction steps, selected visually, 
will eventually lead to elimination of all intermediate vertices. (Apply 
these reduction steps successively to Fig. 15-2 to eliminate all vertices except y , 
and x 3 .) 

Although our ability to reduce the digraph by simple inspection adds 
much to the power and flexibility of signal-flow graphs, it is often better to 
use a more methodical technique that does not depend on visual inspection. 
And such a method is provided by Mason's gain formula. 

Mason's Gain Formula: Let p be a directed path from a vertex a to b in a 
signal-flow graph G. Then the product of the weights of all edges in this path 
is called the path gain of p (same as path product defined in Section 12-2 for 
undirected graphs). Similarly, the product of the weights of all edges in a 
directed circuit (or cycle) T is called the cycle gain of T. For example, a list of 
all directed circuits and their gains in Fig. 15-2 is 


Directed Circuit 

Cycle Gain 

X2X2 (self-loop at X2) 

/ 

X X X2Xj 

ab 

X1X3X1 

cd 

X3X4X3 

gh 

X4X5X4 

ij 

X1X2X3X1 

bed 


Furthermore, for a given signal-flow graph G let us define the following: 
/, = sum of cycle gains of all directed circuits 

t 2 — sum of products of cycle gains of all vertex-disjoint directed circuits 
taken two at a time 

t 3 = sum of products of cycle gains of all vertex-disjoint directed circuits 
taken three at a time 

t k — sum of products of cycle gains of all vertex-disjoint directed circuits 
taken k at a time. 

Thus, for the signal-flow graph of Fig. 15-2 we have 

t x =/+ ab + cd + gh + ij + bed, 

t%= fed + fgh +fij + abgh + abij + edij + bedij, 

1 3 = fedij , 


and 


U = t 5 = • • • = 0, 
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as the maximum number of vertex-disjoint directed circuits in Fig. 15-2 is 
only three. 

Now we are ready to state and illustrate two theorems, which together 
constitute the most important result concerning signal-flow graphs. 

Theorem 15-2 

Let a signal-flow graph G characterize a set of equations Cx = y; then A, the 
determinant of matrix C, is given by 

A = 1 — t\ + tz — t-$ + • • • + ( — l) 9 / g , (15-6) 

where q is the maximum number of vertex-disjoint directed circuits in G. 

Theorem 15-3 

Let a signal-flow graph G characterize a set of equations Cx = y; then the ijth 
cofactor of C, C, 7 , is given by 

C i] =ZP k A*, (15-7) 

k 

where P k is the path gain of the &th directed path from vertex / to j, A k is the value 
of A in Eq. (15-6) for that part of the digraph having no vertices in common with 
the &th directed path, and the summation is over all directed paths from / to j. 

Combining Theorems 15-2 and 15-3, we get Mason’s gain formula, which 
gives the response Xj due to the forcing function y t as 

x j — Cu f 

all other y's= 0 A 

where C u and A are computed from the signal-flow graph G , using Eqs. 
(15-6) and (15-7). 

Besides the original proofs given by S. J. Mason, many different proofs of 
Theorems 15-2 and 15-3 have been published. All are involved—some more 
than others. A particularly elegant proof, due to R. Ash, is given in [13-5], 
pages 102-109. We shall simply illustrate the application of Mason’s gain 
formula by means of an example. 

For Fig. 15-2, 

A = 1 — t 1 T - 1 2 — t 3 

= 1 — / — ab — cd — gh — ij — bed + fed + fgh 
+ fij 4- abgh + abij + edij + bedij — fedij. 

That this indeed is the determinant of the corresponding matrix C as given 
in Eq. (15-5) can be easily verified by direct computation. 
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There are two directed paths from y l to x 3 : 

p x = with path weight p x = c 

and p 2 = y x x x x 2 x 3 with path weight p 2 = be. 

Correspondingly, 

A, = 1 -f-ij +fij 
and A 2 — 1 — U- 

Therefore, according to Theorem (15-3), the (1, 3)th cofactor is 

C 13 — A X P X + A 2 P 2 

= (1 -f-ij +fij)c + (1 - ij)be. 

This too can easily be verified by directly computing the (1, 3)th cofactor of 
the matrix in Eq. (15-5). Thus the gain 

*3 _ ^ 1 3 

y l yi=0 A 

is obtained purely by graph-theoretic computation. The result can be easily 
verified by inverting the corresponding matrix C as given in Eq. (15-5). 

Remarks and References 


The chief advantage of using signal-flow graphs (over substitution method 
or matrix method) lies in their ability to highlight the cause-effect relation¬ 
ships in the system. For example, the feedback edges shown are indeed the 
feedbacks in the actual system. A signal-flow graph can also be used very 
effectively for simplifying the system of equations before solving through 
matrix methods. The simplification is accomplished by flow-graph reductions. 
As can be seen from the literature cited in the next section, signal-flow graphs 
have been widely applied in the study of Markov systems. 

For a computerized solution of a general problem, perhaps matrix 
methods would be faster. For in using Mason’s gain formula, an important 
step is the generation of all directed circuits in the signal-flow graph; and as 
we saw in Chapter 11, the algorithms (suitable for computers) available for 
this are not very efficient. 

Several variations of signal-flow graphs have been proposed and studied 
in the literature, following Mason’s pioneering papers, [15-4] and [15-5]. For 
a survey of different variations, see [15-1]. Some of these variations may offer 
slight computational advantage, but they do so by sacrificing the cause-effect 
relationship, which is so nicely brought out in signal-flow graphs, as presented 
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here. Consequently, not much has come out of these other types of graph 
representations, and Mason’s original graphs are widely used in control 
theory, electrical network analysis, electrical machine theory, heat transfer, 
and analysis of mechanical structures. 

An elementary but thorough treatment of signal-flow graphs with many 
applications can be found in any of the following three monographs: [15-1], 
[15-3], and [15-6], All three use the original signal-flow graphs as proposed by 
Mason, without the later variations. 

15-1. Abrahams, J. R. and G. P. Coverley, Signal Flow Analysis, Pergamon Press, 
Inc.,Elmsford, N.Y., 1965. 

15-2. Ghosh, S. N., and P. K. Ghosh, “Flow Graphs and Linear Systems,” Intern. J. 
Control, Vol. 14 No. 5, Nov. 1971, 961-975. 

15-3. Lorens, C. S., Flowgraphs: For the Modeling and Analysis of Linear Systems, 
McGraw-Hill Book Company, New York, 1964. 

15-4. Mason, S. J., “Feedback Theory: Some Properties of Signal Flow Graphs,” 
Proc. I.R.E., Vol. 41, No. 9, Sept. 1953, 1144-1156. 

15-5. Mason, S. J., “Feedback Theory: Further Properties of Signal Flow Graphs,” 
Proc. I.R.E., Vol. 44, No. 7, July 1956, 920-926. 

15-6. Robichaud, L. P. A., M. Boisvert, and J. Robert, Signal Flow Graphs and 
Applications, Prentice-Hall, Inc., Englewood Cliffs, N.J., 1962. 


15-2. GRAPHS IN MARKOV PROCESSES 

The simplest random process is one in which the outcomes of successive 
trials are independent of each other. In a coin-tossing experiment, for 
example, the outcome of the kih tossing is independent of the outcome of all 
previous tossings. Many phenomena, however, cannot be described by this 
simple model. There are random processes in nature in which the outcome 
depends on the outcome of previous trials. For example, the probability of 
an offspring inheriting a genetic feature does depend on the presence (or 
absence) of this feature in his ancestors. 

A Markov process is the simplest generalization that permits the outcome 
of any trial to be dependent on the outcome of the trial immediately preceding 
it, and on no other.! This simple but powerful generalization gives the 
Markov process an ability to describe random processes in such diverse areas 
as statistical information theory, control theory, genetics, inventory control, 
analysis of computer proprams, and the study of social mobility of different 
classes, to name a few. 

fThis model originated by Andrei Andreivich Markov (in 1907) is a landmark in 
probability theory. Unlike previous mathematicians, who had modeled a random process 
as a sequence of independent trials, Markov saw the advantage of introducing dependence 
of each trial on the outcome of its predecessor. Attempts have of course been made to 
study models with more involved dependency of the present trial on the outcome of the 
past trials, but such studies generally have led to intractable results. 
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A Markov process is a stochastic system capable of assuming one of n 
states s l ,s 2 ,..., s n , and the states change only at discrete points in time. The 
state at the Arth instant depends only on the state of the (k — l)th instant and 
not on any of the previous states. In other words, in a successive sequence of 
trials the outcome of the kth trial depends only on the outcome of the 
(k — l)th trial, and not on any of the preceding ones.t 

Transition Probabilities: To describe a Markov process, we must specify 
for each state, $., the probability of making the next transition to each of the 
n states. The transition probability p u is the probability that if the present 
state of the process is s n the next state will be Sj. These probabilities, p u , 
must satisfy 

0 < Pu < 1 and £p tJ = 1, (15-8) 

i= l 

the latter because the sum of probabilities of transitions to all possible states 
from a given state must be unity. (Note that we have assumed that the 
transition probabilities are constants, and do not vary with time. Such a 
process is called a stationary process or a time-invariant process. t). 

Transition Matrix: The n 2 transition probabilities describing a Markov 
process can most conveniently be given in the form of an n by n transition 
matrix P = [p i} \, subject, of course, to the two conditions in Eq. (15-8). Any 
square matrix with real, nonnegative elements in which the sum of each row 
is 1 is called a stochastic matrix. Thus every stochastic matrix is the transition 
matrix of some Markov process, and vice versa. Let us look at some pro¬ 
perties of a stochastic matrix P. 

1. If P is a stochastic matrix, its &th power P fc is also a stochastic matrix, 
for k = 0, 1,2,... (matrix P° = I, the identity matrix). 

2. If all rows of P are identical, then 

P = = ps = P 4 = .... 


3. Since each row of a stochastic matrix adds up to 1, only n — 1 columns 
need be given: the remaining column can be derived from them. 


In addition to the transition matrix, we also need to know the initial 
probabilities 


tt( 0) = [7t,(0), ?c 2 (0), . . ., 7I rt (0)], 


t A Markov process is often called a Markov chain if the number of states is countable. 
A finite Markov chain is one in which the number of states is finite. In this book we are 
considering only stationary, finite Markov chains in which the time also changes in dis¬ 
crete steps (and not continuously). As there is no possibility of confusion, such a process 
will simply be referred to as a Markov process hereafterwards. 
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where 7r y (0) is the probability of the Markov process being in state Sj at the 
start, that is, at the time instant 0. Clearly, the probabilities in rc(0) must 
satisfy the following conditions: 

For i = 1, 2, . . . , n, 

0 < 71,(0) < 1 and 7i,(0) = 1. (15-9) 

( = 0 

Any real-valued vector whose components satisfy the conditions in Eq. (15-9) 
is called a probability vector. The initial probability vector ir(0) and the transi¬ 
tion matrix completely determine a Markov process. They are sufficient 
to predict the probability of the process being in any state at any time 
instant k. 

Stochastic Graph: An alternative means of describing an «-state Markov 
process is an ^-vertex, weighted, connected digraph G. The vertices of G 
correspond to the states, and an edge (s n Sj ) with a nonzero weight p u 
represents the nonzero transition probability from state s { to s r Such a 
digraph, called a transition graph, is not only of great value in visualizing a 
Markov process, but is also a powerful analytic tool in studying the process. 
Clearly, the weights of the edges in the transition graph G must satisfy the 
conditions in Eqs. (15-8). A digraph in which the edge weights are positive 
quantities and the sum of weights of edges emanating from a vertex is unity is 
called a stochastic graph. 
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Fig. 15-4 Transition matrix and diagram for a Markov process. 
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As an example of a Markov process, let us consider the following version 
of the classic problem of random walk: 

Suppose that a particle moves, according to a probabilistic mechanism, 
along a straight line n = 2m units long between two walls. At each transition 
the particle moves either one unit left or one unit right, each with the proba¬ 
bility If the particle hits the left wall, it gets reflected; but if it hits the right 
wall, it is absorbed into the wall. 

At a given time the process is in one of n — 2m states (i.e., the particle is 
at one of the n points along the line). Only the present state is relevant to what 
the next state may be and not any of the previous states. This is an w-state 
Markov process. For n — 6, the transition matrix and the transition diagram 
for this process are given in Fig. 15-4. 

Note the similarities between a Markov process and a sequential machine, 
discussed in Section 12-8. Both have a finite number of states, and in both the 
state transitions occur at discrete points in time. The main difference between 
the two is that in Markov processes we deal with probabilities, which are 
real-valued quantities instead of a set of symbols, as in the case of sequential 
machines. Transitions in a Markov process do not depend on any (externally 
controlled) inputs, but are governed by probability distributions. Moreover, 
there are no outputs associated with a Markov process. 

Multistep Transition Probabilities 

An important question regarding a Markov process is the following: 
Given that the process was initially in state s t , what is the probability of its 
being in state s } after exactly k transitions? This probability 0 jV (&), called the 
k-step transition probability from state s t to Sj, is given by 

Theorem 15-4 

In a Markov process the &-step transition probability <f> t j(k ) from state s { to 
Sj is equal to the ijth entry in matrix ? k , the &th power of the transition matrix P. 

Proof: Consider the transition digraph. The weight p tJ of edge (s { , Sj ) is the 
probability of going from vertex s { to Sj in one step. Since the transition probabilities 
at each step are stochastically independent, the product 

Pir'Prj 

gives the probability that the process will go from state s t to s r in the first step and 
then from s r to Sj in the second step. Continuing this argument, the probability of 
going from vertex s t to Sj along a directed edge sequence of length k is given by the 
product of the weights of these edges. But the probability of going from s t to Sj 
in exactly k steps is the sum of the probabilities of going from $,• to Sj along all 
directed edge sequences of length k in the transition digraph. That this sum is 
given by the ijth entry in matrix P* can be easily seen by arguments used in 
Theorem 9-10. ■ 
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To illustrate Theorem 15-4, let us consider a three-state Markov process 
whose transition digraph is given in Fig. 15-5. The transition matrix and some 
of its powers are 



'0 

o r 



".5 

.5 0 " 



p = 

.4 

.6 0 

. i 

P 2 - 

.24 , 

.36 .4 

5 



.5 

.5 0 _ 



.2 

.3 . 5 _ 




".2 

.3 

.5 1 



'.3700 

.4300 

. 2000 ' 

R3 = 

.344 

.416 .24 

> 

P 4 = 

.2864 

.3696 

.3440 


.37 

.43 

.2 



.2720 

.3580 

.3700 



'.3099 

.3863 

.3038 




'.3076 

.3848 

.3076" 

pi 0 _ 

.3069 

.3841 

.3090 

> 

pi 5 = 

.3076 

.3846 

.3078 


.3064 

.3837 

.3099 



.3076 

.3845 

.3079 





".3077 

.3846 

.3077" 



P 17 = 

pi 8 _ 

pi9 = p 

20 _ 


3077 

.3846 

.3077 

• 






.3077 

.3846 

.3077 




The iyth entry in P 4 , for example, is the probability that the process will go 
from state s i to s ; in exactly four steps. Let us, for instance, examine all 
directed edge sequences of length four from s 3 to s 2 in Fig. 15-5. These are 

s 3 s 2 s 2 s 2 s 2 with probability of traversing (,5)(.6) 3 = .108, 

J35 2 j, 5 3 5 2 with probability of traversing (.5) 2 (.4)(1) = .1, 

s 3 s{S 3 s 2 s 2 with probability of traversing (.5)(1)(.5)(.6) = .15. 

The sum of their probabilities, .358, is exactly the entry in the (3, 2) position 
ofP 4 . 

For this example, let us make some further observations on the properties 
of matrix P*. 

1. Beyond a certain value of k, ? k contains only nonzero entries. This 
implies that there is at least one directed edge sequence of length k 
(and therefore a directed path of length k or less) from every vertex to 
every other vertex in Fig. 15-5. 
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2. All rows of P* tend to become identical as k increases. This means that 
the A:-step transition probability 0 , 7 (/c) becomes independent of i for 
large k. This result should not come as a surprise, because the effect of 
the starting state s. should wear off after sufficiently many transitions. 

3. As a direct consequence of item 2, the higher powers of P become 
identical; that is, 

p* __ p. p k _ p. p*+1 _ ... 

because P is a stochastic matrix and P* has identical rows. 

Does P* of every Markov process exhibit these properties, or is this 
example a special case? To answer this question, let us take a closer look at 
stochastic digraphs, and try to classify them. 

Classification of States 

A set S of states is said to be closed, trapping, or absorbing if no state 
outside S can be reached from any state s t in S. In other words, there is no 
directed edge from any vertex j. in S to any vertex outside S. For example, 
[j 4 , s s , s 3 } in Fig. 15-6 is a closed set of states. So is { 54 }. A single state s k is an 
absorbing or trapping state if and only if it has a self-loop with weight one, 
such as s A in Fig. 15-6. Clearly, the entire set of states in a Markov process 
trivially constitutes a closed set. If there exists no other closed set of states 
except the entire set of states of the Markov process, the process is called 
ergodic or irreducible. In other words, a process is ergodic if and only if its 


A closed but not ergodic set of states 



Fig. 15-6 Nonergodic Markov process showing closed sets. 
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transition graph is strongly connected; that is, there is a nonzero probability 
of going from any state to any other state. For example, in Fig. 15-5 the 
process is ergodic, but the process shown in Fig. 15-6 is not. A strongly con¬ 
nected closed set of states is called an ergodic or irreducible set. 

Regular Process: Of special interest among ergodic Markov processes are 
those in which there exists a directed edge sequence exactly of length k (for 
some positive integer k ) from every vertex to every other vertex in G. Such a 
process is called a regular Markov process. Clearly, every regular process is 
ergodic, but the converse is not true. For example, in Fig. 15-7(a) the process 
is ergodic but not regular. For there is no directed edge sequence of even 
length from s , to s 2 and no directed sequence of odd length from s t to s 3 ; thus 
there exists no k for which there is a directed edge sequence of k edges from 
every vertex to every other vertex. 


s 2 1 

•->-• 


+ -- + 

Si 1 s 4 

(a) 




(b) Regular 


1 




(c) 



Fig. 15-7 Some ergodic stochastic digraphs. 

The significance of a regular Markov process lies in the fact that for some 
integer k there is a nonzero probability of going from each state to every other 
state in exactly k steps. In terms of the &>step transition matrix P* = <£>(£:), 
we can make the following statement: A Markov process is regular if and only 
if for some integer k every entry in the &-step transition matrix <&(k) is posi¬ 
tive. 

Given the transition digraph G of an ergodic process (i.e., G is strongly 
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connected), how can we tell if the process is regular? To answer this question, 
let us introduce the following definition: A strongly connected subdigraph g 
in a digraph G is said to be a minimal if g has no proper subdigraph of two or 
more vertices that is strongly connected. 

Theorem 15-5 

Let G be a strongly connected stochastic digraph, and let g i9 g 2 ,. . . , g r be its 
minimal strongly connected subdigraphs, having n u n 2 ,. . . , n r vertices, respec¬ 
tively. Then G represents a regular Markov process if and only if the greatest 
common divisor (g.c.d.) of n x , n 2 ,. . ., n r is 1. 

For a proof of this theorem, see Rosenblatt’s paper [15-14]. Let us simply 
illustrate it with some examples. The digraph in Fig. 15-7(c) has two minimal 
strongly connected subdigraphs, with vertex sets [s 3 , ,y 4 } and [$,, s 2 , s 3 , $ 4 }. 
Since the g.c.d. (4, 2) = 2, the process is not regular. On the other hand, Fig. 
15-7(f) also has two minimal, strongly connected subdigraphs, one with five 
vertices and the other with three. Since the g.c.d. (5, 3) = 1, the process is 
regular. 

Clearly, if a strongly connected transition digraph contains a self-loop, 
the g.c.d. is 1; therefore, the process is regular [e.g., Fig. 15-7(d)]. The reader 
is encouraged to write down the transition matrix P for each of the six 
digraphs in Fig. 15-7, and verify Theorem 15-5 by directly computing P fe 
(for appropriate k). 

Periodic Markov Process: An ergodic process is said to be periodic if every 
state can only be entered at certain periodic intervals. The simplest example 
of a cyclic process is one with two states s l and s 2 in which only the transitions 

— > s 2 —► Sy —► s 2 . . . are possible. The transition matrix P and its powers 
for this process are 
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As another example, consider the process in Fig. 15-7(c). Its transition ma¬ 
trix is 
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for k odd and very large. 
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" 0*0 

T 0 * 

040 

-i 0 i 


r 

0 

i 

0 


for k even and very large. 


A Markov process is periodic if and only if its states can be partitioned 
into q subsets (q > 1) such that the process dwells in each of these q subsets in 
q consecutive transitions. 

It can be shown that every ergodic process is either regular or periodic 
[15-11]. 

Markov Processes with Transient States: So far we have been considering 
ergodic Markov processes (i.e., those in which the transition digraphs are 
strongly connected). Let us now examine the processes for which the transi¬ 
tion digraph is weakly connected. A weakly connected digraph G consists of 
two or more fragments (i.e., maximal strongly connected subdigraphs). 

Let us consider a fragment g of G (remember fragment g could be a single 
vertex). If there is no edge directed out of g, then g has to have at least one 
edge going into it, and the vertices in g are closed. In that case, we can delete 
all edges going into g, and then study g independently as an ergodic process 
[the edge weights will satisfy Eqs. (15-8)]. On the other hand, if there is an 
edge going out of g, the vertices of g cannot constitute an ergodic process, 
because (g being maximal strongly connected) once exited, g cannot be 
reentered. Such a set of states, which once left cannot be entered and which 
among themselves are accessible from each other, is called a transient set of 
states. Sets [s { , s 2 , 5 6 ) and [j 3 , s 5 ] in Fig. 15-6 are transient sets. 

The vertices of a weakly connected stochastic digraph can be uniquely 
partitioned into sets T, V u V 2 , , V q such that T is the set of all transient 

states and each V i is irreducible, that is, there is no edge ( a , b) for a e V ; , 
b V., and the vertex set V. is strongly connected. For example, in Fig. 15-6 


T ={s it s 2l s 6 ,s it s j], K, ={j 7 ,j 8 }, V 2 = {j 4 }. 


Clearly, a (weakly connected) stochastic digraph cannot consist of set T 
alone—there must be at least one set V r In other words, it is not possible that 
all states of a Markov process are transient (remember we are only consider¬ 
ing finite Markov processes). 

After a large number of transitions, a Markov process with transient 
states will eventually settle down into one of its irreducible subsets. Such a 
system has two types of distinct behavior, and one may be interested in either 
or both: (1) the behavior of the system before it enters an irreducible set of 
states, and (2) the behavior of the system after it enters an irreducible set. 
Behavior 2 is no different from that of an ergodic system. For once the system 
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enters an irreducible set, it can never leave it, and thus the existence of states 
outside this set is immaterial. Behavior 1 will be studied briefly in Section 
15-3 while analyzing computer programs. 

Thus as far as the asymptotic behavior of Markov processes is concerned, 
we need to study only ergodic processes. Among ergodic processes, also, only 
the regular processes are of importance.f Therefore, in the rest of this section 
we shall study the asymptotic behavior of a regular Markov process. 

Asymptotic Behavior of a Regular Markov Process 

One of the most important questions about a Markov process is what 
happens to it after many many transitions? That is, after the transients die 
down, does the system reach a steady state, independent of the initial proba¬ 
bilities? If so, what is the steady-state probability vector jc(oo) and how do we 
compute it? The answer lies in the behavior of P* as k tends to infinity, and 
Theorem 15-6 provides it for a regular Markov process: 

Theorem 15-6 

If P is a transition matrix of a regular Markov process, then its powers, P*, as 
k tends to infinity, approach a stochastic matrix O = [0, 7 ] having identical rows, 
and each row w of O is a probability vector. 

Outline of the Proof: We first note that since the process is regular there exists 
some positive integer r such that P r = M contains only positive entries. Second, 
we observe that premultiplying any column vector y by a stochastic matrix having 
only positive entries has an averaging effect on the elements of y. This averaging 
effect applied again and again would eventually smooth out differences that may 
have existed among the elements of y. That is, all components of vector M A y 
= P rA y = P A y will have identical elements, as h (and therefore k ) becomes very 
large. Finally, let us observe that this condition is equivalent to P* approaching 
a limit O as k tends to infinity, and the rows of matrix O are identical—each a 
probability vector. The reader is encouraged to fill in the details. ■ 

Theorem 15-6 is perhaps the most important result in the theory of 
Markov processes. Many interesting and useful results for a regular Markov 
process depend on the existence of this limit for P fc , as A: —► oo. 

Since P 50 (which is a shorthand notation for lim P* as k — * oo) exists, we 
can express the ste&dy-state probability 

«(oo) = jt( 0)P~ - jr(0)0>. (15-10) 

fTo quote Feller, “The classification into persistent and transient states is fundamental, 
whereas the classification into periodic and aperiodic states concerns a technical detail. 
It represents a nuisance in that it requires constant reference to trivialities;” W. Feller, 
An Introduction to Probability Theory and Its Applications, Volume I, 3rd ed., John Wiley 
& Sons, Inc., New York, 1968, 387. 
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Writing this equation in terms of its elements, 


7Zj(oo) = 2 7i i (0)(f) i j, j = 1, 2,..., n. (15-11) 

i = 1 

Now, since all rows of O are identical, each element $ i7 is equal to a value 
Wj that depends only on the column index /. Thus 


7lj(°o) = 2 7T,-(0) Wj 
1 = 1 

= Wj Z «X0) 

i = 1 



(15-12) 


Thus a regular Markov process approaches the same limiting probability 
distribution w regardless of where it started. Moreover, this final probability 
vector, it(oo) — w, is the one that appears as rows of matrix <]>. 

For a given regular Markov process, how does one compute this fixed 
vector w, that is, the vector that makes up all the rows of O and is also equal 
to jc(oo)? Several methods are available. Raising the transition matrix P to 
higher and higher powers is one, but it is obviously not a good method. A 
most frequently used method is the following: 

Successive state probability vectors must satisfy 


ir (k + 1) = n{k)?. 


(15-13) 


If the state probability vector has attained its limiting value jc(oo), it must 
then satisfy 

jj(oo) — jt(oo)P, (15-14) 

and since ir(oo) = w, according to Eq. (15-12), it can be rewritten as 


w = wP. (15-15) 

Equation (15-15) implies n simultaneous equations, which can be rewritten as 

w(l — P) = 0. (15-16) 

But since P is a stochastic matrix, the sum of rows of matrix I — P is 
zero. Therefore, I — P is a singular matrix; that is, the n equations in Eq. 
(15-16) are not linearly independent. It can be shown, however, that any 
n — 1 of these equations are linearly independent, and thus we need only one 
more equation to solve for the n unknowns in the row vector w. This is readily 
provided by the relation 


Wj + vv 2 + • • • + W n — 1. 


(15-17) 
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Thus in Eq. (15-16) if we replace any one column, say theyth, of the matrix 
I — P on the left-hand side and change theyth entry from 0 to 1 in the right- 
hand side, we would incorporate Eq. (15-17) into Eq. (15-16). Let this new 
equation be denoted by 

w(l — P)j = \j, (15-18) 

where v y is a row vector of length n with all zero entries except theyth, which is 
1. Equation (15-18) can be solved by either directly inverting the matrix 
(I — P)* or by using signal-flow graphs, as shown in Section 15-1. For 
illustration, let us consider the three-state regular Markov process given in 
Fig. 15-5. Applying Eqs. (15-15) and (15-17) directly, we get 


which is 


Hence 
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W 3 ) 
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w 2 w 3 ). 


w, = ,4w 2 + .5w 3 , 
h ’ 2 = .6 w 2 + .5w 3 , 
w 3 = w,. 
w j — w 3 — .8 w 2 , 


which combined with 


W l + W 2 + W 3 = h 


immediately yields the limiting probability vector w = (-^j ^ ytj). This 

indeed is the result we had obtained earlier by trying 17th and higher powers 
of P. 

Solving the same problem using the form of Eq. (15-18) gives 
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Replacing any one of the columns, say the second, with all l’s we get 
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Finally, 


w = (I — P)f~‘(0 1 0) 

= (A tt A) 


which checks with the value of w obtained earlier. 

Instead of solving Eq. (15-18) by algebraic inversion of matrix (1 — P)f, 
often it is more efficient to use a signal-flow graph to solve Eq. (15-20), 
particularly when matrix P is relatively sparse. 

The signal-flow graph corresponding to Eq. (15-18) is directly obtained 
from the transition graph of the process by a simple modification: 


1. Replace the vertex labels j.’s with w.’s. 

2. Remove all edges incoming into the specified vertex Sj. 

3. Put edges of weight — 1 from every other vertex to s r 

4. Reverse the direction of every edge. [Inverting the edge direction corre¬ 
sponds to transposing the matrix—as required by Eq. (15-4).] 

5. Add a new vertex with label 1 to the digraph and draw an edge of 
unit weight from this new vertex to Sj. 

In this signal-flow graph we can obtain the gain from v to every vertex, which 
will give the elements of w. 

w, 


Fig. 15-8 Signal-flow graph for a 
w 3 -1 w 2 1 1 Markov system. 



Once again, modifying Fig. 15-5 we get the appropriate digraph, as shown 
in Fig. 15-8. For this signal-flow graph [using Eq. (15-5)] the determinant is 
given by 

A = 1 — t x t 2 — t 3 + ••• 

= 1 - 

= 1 - (.5 - .4 - .4) = 1.3, 
and the three relevant cofactors needed in Eq. (15-6) are 

Aj 2 .4, 

A 2 2 = -5, 

A3 2 = - 4 , 


which again checks with the results obtained earlier. 
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Medvedev [15-12] has a different method of obtaining vector w from the 
signal-flow graph, and through some examples he has shown that there are 
cases when graph-theoretic methods are superior to algebraic ones. Another 
computational formula using transition digraphs is given in [15-7]. For more 
on computing vector w via transition graphs, see [15-8]. 

Transient Analysis of a Markov Process 

Matrix $ = P~ for a regular Markov process gives us the steady-state 
distribution, but it does not reveal the transient behavior of the process. It 
does not tell us how fast P* converges to the limit <I>, nor does it give the 
frequencies with which various states in the system were visited before the 
steady-state condition was reached. The answer to these questions lies in 
the behavior of the sequence P, P 2 , P 3 , P 4 ,.... The problem is then to find a 
closed-form expression for matrix P* (as a function of k). This can be 
obtained using z-transforms: 

From Eq. (15-13) we know that 

n(k + 1) = n{k)P. 

Taking the z-transform of both sides of this equation, we get 

P(?) _ r _ g(Q . ) = n(z)P, (15-19) 

z 

where Il(z) is the z-transform of n{k). Rearranging Eq. (15-19), we get 

n(z) = jt( 0)(I - zP)->. (15-20) 

Application of inverse z-transform to both sides of Eq. (15-20) gives 

n{k) = jt( 0) [inverse z-transform of matrix (I — zP) -1 ]. (15-21) 

Thus 

P* = inverse z-transform of matrix (I — zP) -1 . (15-22) 

Equation (15-21) gives the probability vector after k transitions have taken 
place, for A- = 1,2,...., and Eq. (15-22) provides the value of P*, as a func¬ 
tion of k, in a closed form for any Markov process (not necessarily regular). 

Matrix I — zP can be inverted by direct matrix methods, but the flow- 
graph method, as used for inverting (I — P)* in Eq. (15-18), is often found to 
be more efficient. The weight of each edge p u is now multiplied by z. Infor¬ 
mally speaking, z is the transform of the unit delay, and multiplying each 
transition probability by z corresponds to the delay associated with each 
transition. For a thorough treatment of transient analysis of a Markov 
process via signal-flow graphs, see Chapters 3 and 4 of [15-9]. 
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For illustration, let us consider the Markov process depicted in Fig. 15-5. 
we have 
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(15-23) 


Let us now compute the inverse (I — zP) -1 by a signal-flow graph rather 
than algebraically. The signal-flow graph is nothing but the transition graph 
in which each edge is multiplied by z, and the direction of every edge is 
reversed. See Fig. 15-9, which is obtained directly from Fig. 15-5. 


s i 



Fig. 15-9 Signal-flow graph corre¬ 
sponding to Fig. 15-5. 


From Fig. 15-9 we immediately obtain the cycle product terms [as used in 
Eqs. (15-6) and (15-7)] as follows: 


t x — .6 z + ,5z 2 + .2z 3 , 
t 2 = .3z 3 , 

determinant A = 1 — t x + t 2 

= 1 - .6z - .5z 2 + .lz 3 . 

A cofactor 

A 13 = .5z(l — .6z) + .2z 2 
= z(.5 — .lz), 


which is in agreement with the 3,1 entry in Eq. (15-23). 



sec. 15-3 


GRAPHS IN COMPUTER PROGRAMMING 


439 


Now taking the inverse z-transform of both sides of Eq. (15-23), we get 
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Remarks and References 

As the brief exposition in this section shows, weighted digraphs enter very 
naturally into the study of finite Markov chains; in particular, the asymptotic 
behavior of a Markov chain can be derived swiftly from its digraph. A great 
deal more on Markov chains remains unsaid in this section. Recommended 
introductory readings on Markov chains are [15-11] and [15-13]. For 
z-transforms and applications of signal-flow graphs to the study of Markov 
chains, [15-9] is an excellent source. The following is a list of selected reading 
for more on the interaction of Markov processes and graph theory. 
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15-3. GRAPHS IN COMPUTER PROGRAMMING 


Analysis of a given computer program has been an important problem 
from the early days of computer programming. The purpose of such an analy¬ 
sis could be to estimate the running time or storage requirement of a program. 



440 


SURVEY OF OTHER APPLICATIONS 


CHAP. 15 


to subdivide a large program into a number of subprograms, to detect certain 
types of structural errors in the program, to document a program, or simply to 
understand a program written by someone else. For all these purposes it is 
very convenient to represent a program as a digraph. Each vertex represents 
a program block , that is, a sequence of computer instructions having the 
property that each time any instruction in the sequence is executed all are 
executed. Each program block has one entry point (the first instruction in the 
sequence) and one exit point (the last instruction in the sequence). Each edge 
(v p Vj) represents a possible transfer of control from the last instruction in the 
program block v t to the first instruction in the program block v } . Such a 
digraph is called a program digraph. A program digraph can also be thought 
of as an abstraction of a flow chart in which the boxes are shrunk to vertices 
and arrows become the edges. For example, Fig. 15-10 shows the program 
digraph of the flow chart in Fig. 11-9. [Ignore dashed line (v 14 , v x ) for now.] 


Start 



Fig. 15-10 Program digraph of Fig. 11-9. 

Note that there may be more than one program digraph for a given 
program, because one program block may be split into several. Also observe 
that programs that modify their own control and processing instructions in 
the course of execution cannot be represented in this fashion. (Declarative 
statements, such as formats and dimension statements, are ignored in pro¬ 
gram digraphs.) 
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Some obvious but important properties of a digraph representing any 
valid computer program are 

1. A program digraph must be connected. 

2. It has precisely one vertex of zero in-degree, and this vertex corre¬ 
sponds to the start of the program. (If the program has several starting 
points, an additional start vertex can be introduced from which directed 
edges can be drawn to all of these vertices.) 

3. There is precisely one vertex of zero out-degree, and this vertex corre¬ 
sponds to the end of the program. We shall call it the stop vertex. 
(Existence of more than one stopping point in the program can be taken 
care of as in item 2.) 

4. Every vertex in the program digraph must be accessible from the start 
vertex. 

5. The stop vertex must be accessible from every vertex in the program 
digraph. 

Detection of Programming Errors 

To detect and report certain types of structural errors in a source program 
is an essential part of a compiler’s job. The most common of these errors can 
be checked by tracing directed paths from the start vertex to the stop vertex 
in the program digraph. Path tracing will detect program blocks that are 
never entered, or program blocks from which there is no exit leading to the 
stop vertex, or disjoint parts, such as a subroutine that is never called [15-33]. 

Estimation of Program Running Time 

Given a computer program and the execution time for each of the program 
blocks, we are often required to estimate the running time of the program. 
The situation is represented by a program digraph having a weight t. 
associated with each vertex v n where t t is the execution time of the corre¬ 
sponding program block. If we can estimate the number of times each vertex 
is entered (i.e., the program block is executed), the running time of the pro¬ 
gram can be determined (for a particular computer, of course). 

In the program digraph let each directed edge e ; be assigned a nonnegative 
integer/., where f t . is the number of times edge e ; is traversed. The number of 
times a vertex is entered must equal the number of times it is exited, except 
for the start vertex and the stop vertex. These two exceptions can also be taken 
care of by adding an edge of unit weight directed from the stop vertex v n to 
the start vertex v x (see the dashed edge in Fig. 15-10). Now the edge weights 
in this modified digraph satisfy the Kirchhoff current law (KCL) at every 
vertex. (Quantity / may be looked upon as a flow through the /th edge e r 
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Recall the electrical networks in Chapter 13 and the flow networks in Chapter 
14.) 

If we apply KCL to a weighted, connected digraph of n vertices and e 
edges, we get n — 1 linearly independent equations in e unknowns, the 
unknowns being the weights of (i.e., the flows through) the edges. Thus we can 
choose independently only a set of n = e — n + 1 flows, corresponding to 
the chords with respect to any spanning tree in the digraph. The remaining 
n — 1 flows through the tree branches can be expressed in terms of these n 
flows. For example, in the program digraph of Fig. 15-10, ji — 18 — 14 + 1 
= 5. Therefore, the flow through every edge can be expressed in terms of five 
unknowns. Thus the iteration counts through 14 boxes in the flow chart are 
expressed in terms of only five unknowns. These five unknowns can also be 
chosen conveniently by picking a spanning tree. For example, in Fig. 11-9 we 
know that the iteration count of the edge (v 4 , v n ) is N — 1, where N is the num¬ 
ber of vertices in the original graph on which Algorithm 6 was being applied. 
The flows through the remaining four chords are data dependent in a more 
involved fashion. The minimum, maximum, and average running times of 
the program can be estimated by assuming appropriate probable values of 
these four unknowns. At this point the problem of estimating the running 
time becomes quite difficult. For some simple examples, the reader is referred 
to [15-25], [15-26], and pages 95-102 and 364-369 of [11-39], 

In this connection an interesting question is as follows: 

Given a connected digraph G with vertices v x , v 2 , . . . , v n and e edges, 
having weights /j,/ 2 ,. . . ,f e associated with the edges, what are necessary and 
sufficient conditions that G corresponds to some program digraph such that 
v x is the start vertex and v n is the stop vertex? Some necessary conditions are 
obvious: Each f. must be a nonnegative integer. The in-degree of v x — 0 = 
out-degree of v n . Also, the/.’s must satisfy KCL at each vertex except at v x 
and v n . Moreover, the sum of weights of edges going out of v x should be equal 
to the sum of weights of edges going into v n , both being equal to unity. Are 
these conditions sufficient also? The answer, as given by the following 
construction, is yes. 

From the given, weighted digraph G let us construct an unweighted 
digraph H as follows: Replace every edge e,. with f ; parallel edges, where is 
the weight of the edge e } in the digraph G. Clearly, the digraph H will be 
balanced [i.e., d + (v) — d~(v) for every vertex v in H] if and only if KCL is 
satisfied at every vertex in G. Now, from Theorem 9-1, a digraph is balanced 
if and only if it is an Euler digraph; that is, there exists a directed Euler walk 
from v x to v n in H. This is possible if and only if there exists a directed edge 
sequence in G from vertex v x to v n such that every edge e k appears in it exactly 
f k times, and the edge (v n , v x ) does not appear in this directed edge sequence. 
The last statement is equivalent to G being a program digraph with v x as 
start vertex and v n as stop vertex. Thus we have. 
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Theorem 15-7 

Let G be a connected, weighted digraph with n vertices and e edges. Let all 
the edge weights/i,/ 2 ,. . . ,f e be nonnegative integers, and such that they satisfy 
KCL at each vertex, except v x and v„. Furthermore, let the in-degree of v x = 0 
= out-degree of v n , and the sum of the weights of the edges going out of v x = the 
sum of the weights of the edges going into v„ = 1 . Then G corresponds to a pro¬ 
gram digraph in which v x is the start vertex, v„ is the stop vertex, and the weight 
ft of the ith edge is the number of times that edge is traversed in the program. 

Program Segmentation 

Sometimes one comes across a program so large that it cannot be accom¬ 
modated in its entirety into the working memory of the available computer. 
In such a case the program must be segmented before execution. Then the 
segments (pieces) of the program are brought from the slow bulk memory 
(drum, disk, or tape) and executed one at a time. The size of each segment 
must be small enough to be accommodated into the working memory and yet 
must be large enough so that there would not be too many transfers between 
the fast working memory and the slow bulk memory. Thus we have a problem 
of finding an optimal partitioning of the program digraph into subdigraphs 
such that the sum of weights of vertices (here the weight s t of vertex v ( is the 
amount of storage space required by the ith program block) does not exceed a 
specified value. A similar problem arises in a multiaccess, time-sharing envi¬ 
ronment, where each user is given a burst of service of fixed duration. The 
program segments have to be chosen judiciously, not so large that its execu¬ 
tion will exceed the allotted time and yet not too short to require inefficient 
transfers between memories. 

If the program digraph is acyclic (i.e., the program has no loops, which is 
rare for any nontrivial computer program), the partitioning problem is solved 
easily. We sort the vertices in a topological order, and starting from the first 
vertex, we partition the sequence into largest possible (topologically sorted) 
subsequences such that the total vertex weight of no subsequence exceeds the 
specified value. 

The difficulty in segmentation arises because of directed circuits in the 
program digraphs (i.e., loops in the program). A cut made across a directed 
circuit implies interchanges between the two segments, and hence between 
fast and slow memory. Thus one would like to avoid a segmentation that 
causes vertices of one directed circuit to belong to more than one segment. 
The simplest method to accomplish this is to identify all fragments (i.e., 
maximal strongly connected subdigraphs) in the program digraph. Then 
compute the memory requirement of each fragment by adding the weights of 
all its vertices. If the largest of these memory requirements does not exceed 
the size of the working memory, then the problem is solved. For we need not 
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cut any directed circuit, and small fragments can always be combined to yield 
a segment of reasonable size. 

In practice, however, it is found that the largest fragment is usually too 
large to fit in the working memory. For example, in Fig. 15-10 almost the 
entire program digraph forms a fragment. In such a case, cutting across 
directed circuits is unavoidable. The simplest approach suggested in the 
literature is to find a partition that severs the least number of directed 
circuits. This is a difficult problem. To enumerate all strongly connected 
subdigraphs in the program digraph and then consider each as a possible 
segment, although suggested as a solution in the literature [15-32], is a hor¬ 
rendous task. Even generation of all directed circuits in a digraph is extremely 
time consuming, as we observed in Chapter 11. Another method suggested 
for program segmentation is by first ordering the vertices in a certain fashion 
[15-20]. This method of segmentation involves more labor than finding all 
fragments, but less labor than generating all strongly connected subdigraphs 
of the program digraph. 

Segmentation of a program is a very difficult problem, to say the least. 
We are quite far from having found a procedure for an efficient solution for 
this important and interesting graph-theoretic problem. 

Even if we were able to obtain a partitioning that minimized the number 
of severed directed circuits, the solution might not be optimal. Every directed 
circuit is not traversed the same number of times. Obviously, cutting a 
directed circuit with higher iteration count is worse than cutting one with 
lower iteration count. We must have the iteration count of each loop— 
information rarely available ab initio , because of its data dependence. A 
stochastic analysis of the program, assuming that it behaves as a Markov 
system, is often the answer to this problem. 

Stochastic Model of a Program 

One method used to estimate the Telative frequencies of traversal of 
different edges and vertices is to assume the program digraph to be a stochastic 
digraph, in which the weight p u of edge (v., v } ) is the conditional probabilityt 
that the program execution will go to program block Vj given that it has 
executed the program block v r Once the program reaches v n , the stop vertex, 
the probability of its branching to any other vertex is zero. To satisfy the 
conditions in Eq. (15-8), we add a self-loop of weight one at vertex v n . Thus 
v n is an absorbing state, the only absorbing state in the system, and the 
remaining vertices correspond to transient states. A very simple stochastic 
program digraph and its transition matrix P are shown in Fig. 15-11. 

fNote that these weights pif s have nothing to do with weights /*’s assigned in the 
previous analysis of the program digraph. While /*’s obey KCL at each vertex, pi/s obey 
Eq. (15-8). 
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Fig. 15-11 Stochastic program digraph and its transition matrix. 


The transition matrix P of any stochastic program digraph can be 
expressed in the form 


P = 


Q 

6 ’ 


T 

1 


(15-24) 


where Q is an (n — 1) by (n — 1) submatrix corresponding to the transient 
states, T is an (n — 1) by 1 column vector, and 0 is the row vector of n — 1 
zeros. 

Let us look at matrix P*, which represents the /c-step transition prob¬ 
abilities. Clearly, 


"Q* 

T'“ 

_ o 

1 


(15-25) 


(T is a matrix that we need not compute here.) 

The ijth entry in Q* is the probability of being in transient state v } after 
exactly k transitions from the starting state v, (also transient). Let us first 
show that Q* becomes 0 as k becomes large. 

In the stochastic program digraph G with n vertices, let p t be the proba¬ 
bility that starting from vertex v. the program will not reach v n stop vertex in 
n steps (or less). Since there exists at least one directed path (of length n — 1 
or less) with a nonzero path product from v t to v n , quantity Pi < 1. Let p be 
the largest of all p.' s. The probability of not reaching v n in n steps is less than 
p, in 2 n steps it is less than p 2 , and so on. Since p < 1, these probabilities tend 
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to zero. That is, the sum of the entries in the /th row of Q* as k —> oo becomes 
zero, for i = 1, 2, . . ., n — 1. Now since Q. k = 0 for some large k, we can 
write 

I + Q + Q 2 + • • • + Q* -1 = (I - Q)-\ (15-26) 

which can be easily verified by multiplying both sides with I — Q. Equation 
(15-26) says that matrix I — Q is nonsingular. For brevity, let us denote 
matrix (I — Q) _1 by R = [r tj ]. 

The sum of probabilities of reaching v ; from v, in 1 step, 2 steps, 3 steps, 
. . . , and k — 1 steps is equal to r tJ , the (/th entry in (I — Q) _1 , according to 
Eq. (15-26). This is precisely the average number of times vertex v } appears in 
the random paths starting from v.. In a program, since we always start from 
v l and end at v n , the first row of R gives us the average iteration counts of all 
n — 1 transient vertices. That is, 

r xj = the average number of times program block v } will be executed in a 
typical run. 

Matrix R = (I — Q)~\ besides giving the average number of occurences 
of different vertices, is a storehouse of a lot of other useful information about 
the transient behavior of the stochastic program digraph. For example, let 
h u denote the probability that the program will ever execute Vj having 
executed v r Clearly, 

r.j — h u • (average number of times v s occurs, given that the system 
started in Vj) 


Therefore, 

hij = jr' (* 5 - 27 ) 

r jj 

To extract another piece of information from matrix R, let f? } denote the 
probability that the program starting from v 5 will never return to v r Now, to 
pass r j} times through a vertex v } , the process must reach once and then 
return there r u — 1 times. Therefore, the probability of returning to Vj after 
leaving it once is given by 


rjj — 1 = |_L. 

r ij r n 


Hence the probability, of never returning to v } after leaving it once is 


A = r- 


r ij 


(15-28) 
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Finally, let denote the probability that v 3 is executed exactly k times 
(starting from the start vertex v x ). Then 

a, = (/*„•)(! - A,)*-W (15-29) 

For illustration let us continue with the example of Fig. 15-11. The Q 
matrix is 

i H o' 

„ 0 0 0 1 
Q — i 

0 } 0 | 

_0 0 $ 0 _ 

and matrix R = (I — Q) _1 comes out to be 



~1 

If 

IS 

7 -i 
TT 


0 

5 8 

3 

7 

Q)- 1 = 

3T 

TT 

TT 

0 

14 

1 4 

7 


33 

TT 

TT 


_0 

A 

A 

1 4 

ttJ 


Therefore, 

r 12 = H i s th e average number of times vertex v 2 gets executed, 

7 

A, 4 = “it ~ ? IS probability that v 4 will be executed, 

TT 

= {-^ is the probability that v 3 will never be executed given that it has 
just been executed. 

And so on. 

Having computed the average number of executions of each vertex v p one 
can immediately get the expected execution time for the entire program as 

t= ttfij, (15-30) 

7=1 

where t 3 is the execution time of the program block v r Equation (15-30) 
assumes that there is no parallel processing (i.e., no two program blocks are 
executed simultaneously). 

For the purpose of segmenting a fragment g in the program digraph, we 
can compute the least frequently used edge in g, delete it from the fragment, 
and check if the resulting digraph can be partitioned into appropriate size 
subfragments. If not, we remove the least frequently used edge in the 
remaining digraph. This process is continued till subfragment g is segmented 
into the required size subfragments. This is the stochastic segmentation 
procedure suggested in [15-33]. 
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The most difficult part of stochastic analysis of a program is the labor and 
inaccuracies involved in the evaluation of the transition matrix P, because 
the branching probabilities are data dependent. They can, however, be 
estimated by simulation methods using sample input data [15-33]. Another 
difficulty is that for many programs the assumption about the weights p./s 
being statistically independent is not valid. 

Remarks and References 

In analysis and design of application programs and system software you 
are likely to encounter graph theory more often than any other branch of 
mathematics. As we have just seen, a weighted digraph is a natural and useful 
representation of a computer program, and is of immense aid in timing 
analysis, segmentation, and in detecting certain common types of structural 
errors. In addition, there are other programming applications that were not 
discussed here. Some of these are 

1. Program optimization, [15-16]. 

2. Automatic flow charting, [15-19] and [11-4], page 245. 

3. Graphs as data structures, [15-34], 

4. Parallel-processing design and evaluation, [15-23]. 

5. In proving equivalence of two programs, or proving validity of a 
program by transforming the program digraph into canonical forms. 

The following list of papers is a sample of the growing literature on 
utilization of graphs in the art of computer programming. 
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15-4. GRAPHS IN CHEMISTRY 

Although Arthur Cayley used trees to represent the structures of organic 
molecules 100 years ago (and, indeed, much of the early interest in the study 
of trees was motivated by this representation), it is only recently that graph- 
theoretic techniques are coming into use for characterization and identifica¬ 
tion of chemical compounds. This is due to (1) the advent of the electronic 
computer with its ability to handle graphs, and (2) the ever-intensifying need 
of the chemist to have a mechanized information retrieval system capable of 
dealing with the millions of organic compounds known today. 

Given a chemical substance and some of its properties (such as molecular 
weight, chemical composition, mass spectrum, etc.), the chemist would like 
to find out if this substance is a known compound. If he is able to identify 
this compound, he may like to know some additional properties of the 
compound, or if the compound is “new” he would like to know its structure, 
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and then include it in the dictionary of known compounds. It is essential, 
therefore, to have a standard representation for a compound, and the 
representation must be compact, unambiguous, and amenable to classifica¬ 
tion. 

It was shown in Section 3-6 how a chemical compound can be represented 
by means of a connected graph, with the atoms as the vertices and the bonds 
between them as edges. For compactness the hydrogen atoms are omitted 
from the representation, as they are implied by every unused valence of the 
other atoms. For example, the structural graph of aminoacetone C 3 H 7 NO, 
with its H atoms stripped off, is shown in Fig. 15-12. [Recall that the valence 
for carbon (C) is 4, for nitrogen (N) it is 3, and for oxygen (O) it is 2.] 

O 


Fig. 15-12 Structural graph of amino- 
C C C N acetone. 

The structural graph of a chemical compound, in general, contains much 
more information than the molecular formula does. For example, the 
molecular formula C, 0 H 22 can denote any of its 75 isomers (75 being the 
number of unlabeled trees with 10 vertices and with no vertex of degree five 
or more), while the graph specifies the exact isomer. It must be kept in mind, 
however, that a structural graph does not contain all the information con¬ 
tained in the three-dimensional model of the chemical compound. The 
structural graph does not specify the bond distances or the bond angles of 
the molecule. Since these are known only for a small number of organic 
molecules anyway, this is not much of a handicap. A slightly more serious 
shortcoming of a graph is its inability to distinguish between stereoisomers 
[15-40]. Thus, except for stereochemistry purposes, the structural graph gives 
a reasonably adequate description of a chemical compound. 

Canonical Representation of a Molecule 

As pointed out above, a standard representation of chemical structures is 
a precondition for a computerized information retrieval system. From the 
structural point of view, organic molecules can be divided into two classes: 
(1) the aliphatic compounds, and (2) the ring compounds. The structural 
graph of an aliphatic compound has no circuit, except possibly circuits of 
length two arising out of multiple bonds, which are represented by parallel 
edges, as shown in Fig. 15-12. The graph of a ring compound contains at 
least one circuit of length three or more. 

Since the graph of an aliphatic compound is a tree (if we ignore parallel 
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edges), it can easily be given a canonical representation as follows: Every 
tree has a unique centroid or a pair of centroids (parallel edges are considered 
as single edges for the purposes of locating the centroid). The centroid can be 
used as the root of the tree (recall Section 10-3), and each subtree attached to 
the root is a radical. The subtrees can be ordered by the number of vertices 
they contain in a nondecreasing order. Each radical is further subdivided into 
subradicals, which are ordered in the same fashion. This process produces a 
unique linear code for each tree—a string. For example, the code for the tree 
in Fig. 15-12 is C(C) ( = 0)(C(N)). For more details on coding of aliphatic 
compounds, see [15-42]. 

Cyclic compounds are less tractable, because no unique centroid (or pair 
of centroids) can be defined in a graph with circuits. Fortunately, the chemist 
need not be concerned with the general problem of coding a graph (a very 
difficult problem, as discussed in Chapter 11). The structural graph of 
almost every ring compound is (1) planar, (2) a regular graph of degree three, 
and (3) contains a Hamiltonian circuit. It is not very difficult to find a unique 
linear code for such a graph. There exists an «-sided polygon in such a graph 
of n vertices, and a description of the graph requires only some notation for 
the remaining n/2 edges. These edges may be represented by a sequence of n 
numbers consisting of their end vertices. For details on this coding scheme 
for most ring compounds, see [15-42]. 


Matching of Chemical Structure 

The problem of determining whether or not two chemical compounds 
(having the same chemical formula) are identical is the same as the graph 
isomorphism problem, considerably simplified by the labels of the vertices. 
Finding a unique code for a graph implies the solution of the isomorphism 
problem as well, because two graphs would be isomorphic if and only if their 
codes were the same. For chemical structures, however, it is generally easier 
to perform a direct vertex-by-vertex matching than to first find a unique code 
for each graph. We shall describe one such algorithm for matching of 
chemical structures based on Sussenguth’s paper [11-59]. 

The underlying idea behind this algorithm is to use various properties 
(such as labels, degrees, adjacencies, etc.) of vertices in the two graphs to 
generate pairs of vertex subsets, which must match if the graphs are to be 
isomorphic. An increasing number of properties are used to partition vertices 
into smaller and smaller subsets. Eventually, either every vertex in one graph 
is uniquely paired off with a vertex in the other graph, or two subsets of 
vertices characterized by identical properties in the two graphs do not have 
the same number of vertices. (A third case arises when more than one 
isomorphism exists between the two graphs.) The process can be best 
explained with an example: 
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Fig. 15-13 Structural graphs of two molecules. 




Let us determine whether or not the two molecules in Fig. 15-13 are 
identical (H atoms are not shown, as usual). The vertices are arbitrarily named 
(1), (2), . . ., (8) in G and (a), ( b ), . . . , (h) in J. 

The process of generating matching subsets with common properties is 
outlined in Table 15-1. For instance, vertices representing carbon atoms in 
G must correspond to those representing carbon atoms in J. Of these, the 
pendant C vertices in G must correspond to pendant C vertices in J, and so 
on. (Note that Table 15-1 shows only a part of the subsets that are actually 
generated and matched.) From Table 15-1, we conclude that G and J are 


Criteria for Corresponding Subsets in 

Subset Subset 

Formation G J Number 


Vertex label: C 

1,2, 4,7 

b, e,f, h 

1 

O 

3,6,8 

a, c,g 

2 

N 

5 

d 

3 

Degree: One 

1,3, 5, 6 

a, d, g, h 

4 

Two 

8 

c 

5 

Three 

2,4 

e,f 

6 

Four 

7 

b 

7 

Bond: Single 

1,2, 3, 4, 5, 6,7 

a, b,d,e,f,g,h 

8 

Double 

7,8 

b, c 

9 

Intersection of 

subsets 1 and 4 

1 

h 

10 

Vertices adjacent to 

1 lh 

2 

f 

11 

5 Id 

4 

e 

12 

lib 

4, 6,8 

a , c, e 

13 

Intersection of 

subsets 4 and 13 

6 

a 

14 

Remaining vertex 

3 

g 

15 


Table 15-1 Matching of Chemical Structures 






sec. 15-4 


GRAPHS IN CHEMISTRY 453 


isomorphic and the vertex correspondence is 

/I 2 3 4 5 6 7 8\ 

\h f g e d a b c) 

A similar procedure can be used to identify one given graph as a subgraph of 
another. 

Computerized Chemical Identification 

Given the chemical formula of a “new” substance and the valence rules, 
one can generate the list of all distinct chemical structures possible, using 
graph enumeration techniques. Computer programs have been written to 
perform this operation. (It is necessary, of course, to have a coding scheme 
that provides a unique representation for a structural graph.) This method of 
producing an exhaustive list of all possible isomers gets out of hand as the 
number of atoms in the molecule-increases. For example, there are over \ 
million structures possible for C 20 H 41 OH. It is therefore necessary to provide 
additional chemical information (such as the type of radicals ruled out as 
unstable) to keep the list to a manageable size. A computer program can be 
written to compare each of the structures in the list against various sets of 
analytical data, particularly mass spectra. 

As a part of the continuing effort toward a system of automated identifica¬ 
tion of chemical compounds, a computer language, called DENDRAL, has 
been developed at Stanford University. See [15-38] and [15-39]. One of the 
programs in DENDRAL generates the list of all tree-type potential isomers 
from an input of molecular formula and mass spectrum. The program, written 
in LISP language, consists of 40,000 words, and is run on a PDP-6 time¬ 
sharing system at Stanford. One of the long-term goals of such an effort is to 
develop a tool for automated chemical exploration of the planets [15-42]. 

Remarks and References 

Lederberg and Feigenbaum and their team at Stanford University have 
done the pioneering work in computerized chemical identification via graph 
theory. A number of technical reports and papers (four of them referenced in 
[15-44]) describe various aspects of the DENDRAL program. See also 
[15-37]. For a very readable description of the essentials of DENDRAL, see 
the paper by Feigenbaum and Lederberg [15-38]. Another paper by Lederberg 
[15-40] is recommended as a well-written exposition of how graphs can be 
used for representing structures of organic molecules—both tree type and 
ring type. 

Sussenguth in his doctoral thesis and in a subsequent paper [11-59] has 
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given an algorithm for matching chemical structures. He reports that the com¬ 
putation time in his algorithm varies only as the square of the number of 
vertices; and that his computer program when run on an IBM 7090 took 6 to 
7 seconds for matching 50-vertex graphs and only a few thousandths of a 
second to detect if the graphs were not matched. 

A survey of computer methods in handling chemical structures is available 
in [15-44], which includes most of the relevant references through 1966. 
Other papers recommended are [15-38], [15-43], and [11-59]. The search for a 
good coding system is far from over. Papers proposing alternative notational 
systems continue to appear in the Journal of Chemical Documentation. 

15-37. Buchanan, B. G., G. L. Sutherland, and E. A. Feigenbaum, “HEURISTIC 
DENDRAL: A Program for Generating Explanatory Hypotheses in Organic 
Chemistry,” Machine Intelligence , Vol. 4 (Meltzer and D. Michie, eds.), Edinburgh 
University Press, Edinburgh, 1969. 

15-38. Feigenbaum, E. A., and J. Lederberg, “Mechanization of Inductive Inference 
in Organic Chemistry,” in Formal Representation of Human Judgement (B. Klein- 
muntz, ed.), John Wiley & Sons, Inc., New York, 1968, 187-218. 

15-39. Gluck, D. J., “A Chemical Structure Storage and Search System Developed at 
Du Pont,” J. Chem. Doc., Vol. 5, No. 1, Feb. 1965, 43-51. 

15-40. Lederberg, J., “Topology of Molecules,” in The Mathematical Sciences, The 
M.I.T. Press, Cambridge, Mass., 1969, 37-51. 

15-41. Lynch, M. F., J. M. Harrison, W. G. Town, and J. E. Ash, Computer Handling 
of Chemical Structure Information, Elsevier Publishing Company, Amsterdam, 
1971. 

15-42. Meetham, A. R., “Partial Isomorphisms in Graphs and Structural Similarities 
in Tree-Line Organic Molecules,” Proc. IFIP Congress, 1968. 

15-43. Penny, R. H., “A Connectivity Code for Use in Describing Chemical Structures,” 
J. Chem. Doc., Vol. 5, No. 2, May 1965, 113-117. 

15-44. Tate, F. A., “Handling Chemical Compounds in Information Systems,” Ann. 
Rev. Inf, Sci. Tech. Vol. 2 (C. A. Cuadra, ed.), John Wiley & Sons, Inc., New 
York, 1967, 285-309. 


15-5. MISCELLANEOUS APPLICATIONS 

There is virtually no end to the list of problems that can be solved with 
graph theory. In addition to applications covered in the last four chapters, 
many applications were mentioned in earlier chapters, for example, binary 
search trees for file organization (Chapter 3), design of printed-circuit board 
(Chapter 5), dimers problem in crystal physics (Chapter 8), teleprinter’s 
problem (Chapter 9), and ranking problem (Chapter 9). The following are 
some additional examples of applications. 

Information Retrieval: In a modern information retrieval system each 
document carries a number of index terms (also called descriptors). The index 
terms are represented as vertices, and if two index terms v i and v } are closely 
related (such as “graph” and “tree”), they are joined with an edge (v„ Vj). 
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The simple, undirected (and possibly disconnected) large graph thus produced 
is called the similarity graph. Components (i.e., maximally connected sub¬ 
graphs) of this graph produce a very natural classification of documents in 
the system. For retrieval, one specifies some relevant index terms, and the 
maximal complete subgraph that includes the corresponding vertices will 
give the complete list of index terms which specify the needed documents. 
Establishing graph isomorphism is needed in a situation such as an informa¬ 
tion retrieval system for chemical compounds. The reader is referred to 
Salton [15-60] for more on the subject. Reference [11-1] is also recommended. 

Analysis of Lumped Physical Systems: In Chapter 13 we saw how a system 
consisting of two-terminal electrical components was represented (and 
analyzed) by means of a graph. In that case the graph looked very much like 
the network schematic diagram. This approach can be generalized so that a 
graph (called the system graph ) is used to model any physical system built 
from a finite number of interconnected components, given the model of each 
component, of course. The system graph is a convenient tool in analysis of 
the entire physical system. See Trent [15-61] or Koenig, Tokad, and Kesavan 
[15-57] for-more details. 

Matrix Inversion: For inverting a large (say, 100 by 100) sparse matrix M 
by a computer, the straightforward application of the Gaussian elimination 
method is inefficient, is susceptible to poor accuracy, and causes storage 
problems. The following graph-theoretic method has been found to be better: 

1. Replace each nonzero entry in the given matrix M with a 1, and permute 
the rows and the corresponding columns of the resulting binary matrix 
to make all diagonal entries as l’s. 

2. The matrix X so obtained is now regarded as the adjacency matrix of a 
digraph G (the self-loops corresponding to l’s along the diagonal are 
deleted). 

3. The resulting digraph is partitioned into its fragments. 

4. A fragment, if too large, is “torn” further into smaller fragments by 
removing an appropriate edge. 

5. The smaller matrices are inverted, and from them the inverse of the 
original matrix M~’ is obtained. 

For further details see Harary [15-54] and Iyer [15-56]. 

Graphs of Groups: Cayley showed that every group of order n can be 
represented by a strongly connected digraph of n vertices, in which each 
vertex corresponds to a group element and edges carry the label of a generator 
of the group (originally, Cayley used edges of different colors to show dif¬ 
ferent generators). Thus the graph of a cyclic group of order n is a directed 
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circuit of n vertices in which every edge has the same label. The digraph of a 
group uniquely defines the group by specifying how every product of elements 
corresponds to a directed edge sequence. This digraph, known as the Cayley 
diagram, is useful in visualizing and studying abstract groups. For more 
details on graphs of groups, see [15-52], 

Linguistics: Graphs have been used in linguistics to depict parsing dia¬ 
grams. The vertices represent words and word strings and the edges represent 
certain syntactical relationships between them. A set of words (vocabulary) 
and a set of rules (grammar) for forming strings (sentences) characterize a 
language. In other words, the language then is a set of all legal strings so 
generated. (Natural languages, because of their complexity, have defied 
attempts at such complete specifications.) One problem in computational 
linguistics is to identify whether or not a given string belongs to a language, 
whose vocabulary and grammar are given. For more on graphs in computa¬ 
tional linguistics, see [15-55]. 

Sociological Structures: Digraphs under the name sociograms have been 
used to represent relationships among individuals in a society (or group). 
Members are represented by vertices and the relationship (admiration, 
association, influence, etc.) by directed edges. Connectedness, separability, 
complete subdigraphs, size of fragments, and so forth, in a sociogram can be 
given immediate significance. A number of tribes have been studied by 
anthropologists and are classified according to their kinship structures. For 
more on this topic, see Flament [15-51], Harary [15-53], and Chapter 8 of 
Anderson [15-45]. 

Graph theory has also been used in economics [15-46], logistics, cyber¬ 
netics, artificial intelligence, pattern recognition, genetics, reliability theory, 
fault diagnosis in computers, studying the structure of computer memory, 
and the study of Martian canals [15-63]. A mathematical model of disarma¬ 
ment [1-2] has been attempted with graph theory and so have the conflict in 
the Middle East and the structure of Mozart’s opera, Cosi fan tutte [15-53]. 
And thus goes the ever-increasing list of applications of graph theory. Admit¬ 
tedly, in some of these applications a graph is used only as a means of visual 
representation, and no more than a trivial use is made of graph theory itself. 
There are many cases, however, where important and not-so-obvious results 
are obtained through a deeper use of graph theory. 

Bibliographies and Further Reading: Although throughout the text we have 
provided selected readings, the number of published papers on graph theory 
is much larger (over 3000). There are several good bibliographies available, 
where most of the published material on the subject is listed. The best known 
among these are Zykov [15-64], Turner and Kautz [15-63], Turner [15-62], and 
Deo [15-49]. An informative article containing a systematic list of definitions 
and a bibliography of graph theory as applied to physics is [15-50]. Some 
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additional books recommended are Mayeda [15-59], Chen [15-48], Marshall 
[15-58], and Behzad and Chartrand [15-47]. I hope your interest in graph 
theory has been aroused sufficiently so that you will go exploring in the cited 
literature on your own. 

15-45. Anderson, S., Graph Theory and Finite Combinatorics , Markham Publishing 
Company, Chicago, 1970. 

15-46. Avondo-Bodino, G., Economic Applications of the Theory of Graphs, Gordon and 
Breach, Science Publishers, Inc., New York, 1962. 

15-47. Behzad, M., and G. Chartrand, Introduction to the Theory of Graphs, Allyn 
and Bacon, Inc., Boston, 1972. 

15-48. Chen, W., Applied Graph Theory, North-Holland Publishing Company, Amster¬ 
dam, 1971. 

15-49. Deo, N., “An Extensive English Language Bibliography on Graph Theory and 
Its Applications,” National Aeronautics and Space Administration/JPL (California 
Institute of Technology) Technical Report 32-1413, October 1969; Supplement 1, 
April 1971. 

15-50. Essam, J. W., and M. E. Fisher, “Some Basic Definitions in Graph Theory,” 
Rev. Mod. Phys., Vol. 42, No. 2, April 1970, 272-288. 

15-51. Flament, C., Applications of Graph Theory to Group Structure, Prentice-Hall, Inc., 
Englewood Cliffs, N.J., 1963. 

15-52. Grossman, I., and W. Magnus, Groups and Their Graphs, The Random House/ 
Singer School Division, New York, 1964. 

15-53. Harary, F., “Graph Theory as a Structural Model in the Social Sciences,” in 
Graph Theory and Its Applications (B. Harris, ed.), Academic Press, Inc., New 
York, 1970, 1-16. 

15-54. Harary, F., “Sparse Matrices and Graph Theory,” in Large Sparse Sets of 
Linear Equations (J. K. Reid, ed.), Academic Press, Inc., New York, 1971, 139-150. 

15-55. Harris, Z., Mathematical Structure of Language, John Wiley & Sons, Inc. (Inter¬ 
science Division), New York, 1968. 

15-56. Iyer, C., “Computer Analysis of Large-Scale Systems,” Ph.D. Thesis, Department 
of Electrical Engineering, University of Hawaii, Honolulu, May 1971. 
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Systems, McGraw-Hill Book Company, New York, 1967. 

15-58. Marshall, C. W., Applied Graph Theory, John Wiley & Sons, Inc., (Interscience 
Division), New York, 1971. 

15-59. Mayeda, W., Graph Theory, John Wiley & Sons, Inc., (Interscience Division), 
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BINET-CAUCHY THEOREM 


The following classical result, known as the Binet-Cauchy theorem, is useful 
in calculating the determinant of the product of two matrices: If Q and R are k 
by m and m by k matrices, respectively, with k < m, then the determinant of the 
product 


det(QR) = the sum of the products of corresponding 
major determinants of Q and R. 

The term major determinant (or simply major) means the determinant of the 
largest square submatrix of Q (or R) formed by taking any k columns (or rows) 
out of m. The term corresponding implies that if columns i u i 2 ,... , i k of Q are 
chosen for a particular major, the corresponding major of Q must consist of rows 

ii,i 2> ..., i k of Q- Clearly, there are such product terms. 

Before proving the theorem, let us illustrate with an example: 

Let 


and 


det(QR) = det 


det 


Q = 


R = 


-3 

-1 

1 -1 

3 -2 


4 -3 -2" 

2 -1 0 . 

1 -1 
-2 0 
3 -2 

1 -1 
2 0 
-3 -2 

-1 0 


det 


+ det 


+ det 
det 


4 

2 


= 2-(-2) + 4-1 + (-2).4 = -8. 
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Proof: To evaluate det(QR), let us devise and multiply two (m + k ) by (m + k) 
partitioned matrices: 


u 

Q~ 


~ Q O' 


' 0 

QR~ 

_o 

l 

WJ 


_-l» R_ 


_-u 

R _ 


where l m and I* are identity matrices of order m and k, respectively. Therefore, 


That is, 


det 



0 

R 


det 



QR~ 

R 


det(QR) = det 



0 

R 


(A-l) 


Let us now apply Cauchy’s expansion method to the right-hand side of Eq. (A-l), 
and observe that the only nonzero minors of any order in matrix — \ m are its 
principal minors of that order. We thus find that the Cauchy expansion consists 
of these minors of order m — k multiplied by their cofactors of order k in Q and 
R together. ■ 
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NULLITY OF A MATRIX 
AND SYLVESTER’S LAW 


If Q is an n by n matrix, then Qx = 0 has a nontrivial solution x ^ 0 if and 
only if Q is singular; that is, det Q = 0. The set of all vectors x that satisfy Qx = 0 
forms a vector space called the null space of matrix Q. The rank of the null space 
is called the nullity of Q. Furthermore, it can be shown that 

rank of Q + nullity of Q = n. (B-l) 

These definitions and Eq. (B-l) also hold when Q is not square but a k by n matrix, 
k < n. 

An important result involving nullity of matrices is Sylvester’s law of nullity, 
which can be stated as follows: 

Sylvester's Law: If Q is a k by n matrix and R is an n by p matrix, then the 
nullity of the product cannot exceed the sum of the nullities of the factors; that is, 

nullity of QR < nullity of Q + nullity of R. (B-2) 

Proof: Since every vector x that satisfies Rx = 0 must certainly satisfy QRx = 0, 
we have 

nullity of QR > nullity of R 0. (B-3) 

Let s be the nullity of matrix R. Then there exists a set of s linearly independent 
vectors 


{Xj, X 2 , • • •, x,} 

forming a basis of the null space of R. Thus 

Rx, = 0, for / = 1, 2,. .., s. (B-4) 
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Now let s + t be the nullity of matrix QR. Then there must exist a set of t 
linearly independent vectors 

{x$+i, ^s+2> • * • > X s +t} 

such that the set 

{Xi, X2, • ■ . j X s , X I+1 , X s + 2> • • • > Xj+/} 

forms a basis for the null space of matrix QR. Thus 

QRx,- = 0, for i — 1, 2, . . ., s, s + 1, s + 2, «..*,, s + /. (B-5) 

In other words, of the s + / vectors x, forming a basis of the null space of QR, 
the first s vectors are sent to zero by matrix R and the remaining nonzero Rx,-’s 
(i = s + 1, s + 2,. . ., s + t) are sent to zero by matrix Q. Vectors 

^X s -|.j, ^X s + 2> • • • > f^X 

are linearly independent; for if 

0 = tfiRx J+ , + tf 2 Rx i+2 + • • • + tf,Rx J+f 
= R(a 1 x :S+1 + a 2 x s +2 + • • • + a c x s+l ), 

then vector (aix i+J + ^2X5+2 + • • • + a,x s+[ ) must be the null space of R, which 
is possible only if = a 2 = • • • = a, = 0. 

Thus we have found that there are at least t linearly independent vectors which 
are sent to zero by matrix Q, and therefore 

nullity of Q > t. 

But since 

t = (s + t) — 5 

= nullity of QR — nullity of R, 

Eq. (B-2) follows. 

Substituting Eq. (B-l) into Eq. (B-2), we find that 

rank of QR > rank of Q + rank of R — n. (B-6) 

Furthermore, in Eq. (B-6) if the matrix product QR is zero, then 


rank of Q + rank of R < n ■ 


(B-7) 
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Arborescence, 206 
number of, 223, 238 
root of, 206 

Articulation point (see Cut-vertex) 
Assignment problem, 178, 396 
Automata (see Sequential machines) 
Automatic flow charting, 448 
Automorphism, 267 


B 

Backtrack, 288, 301 
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definition, 257 
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Counting series, 243, 257 
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Critical path method (CPM), 400-408 
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Cut, 387 
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properties of, 69-71 
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Data structure in graph algorithms, 270-273 

Decanting problem, 13 

Decision tree, 41 
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Deficiency, 181 
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matrix, 139, 164 
of a permutation, 251 
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Depth-first search, 301-304 
Deterministic sequential machine, 342 
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acyclic, 301, 410 
adjacency matrix of, 220 
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complete, 197 
definition, 194 
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edge, 236 
Euler, 203 
game, 410 
irreflexive, 199 
kernel of, 411 
pseudosymmetric, 197 
reflexive, 199 
regular, 197 

representation of permutations, 251 
simple, 197 

strongly connected, 202 
symmetric, 197, 199 
transitive, 200 
weakly connected, 202 
weighted, 400 
Dihedral group, 266 
Dimension of vector space, 124 
Dimer problem, 185-186 
DIP,317 
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Directed circuits, 202, 212, 230, 232, 287, 
291,421,443 

Directed graph (see Digraph) 

Directed Hamiltonian circuits, 312 
Directed path, 231, 288, 403, 423 
Disconnected graph, 21, 139, 159, 161 
Distance matrix, 61, 273 (see also Weight 
matrix) 

Distance between two spanning trees, 59 

Distinct representatives, 179 

Division ring, 117 

Dominating set, 172-173 

Domination number, 173 

Dummy activity, 401 

Dual of graph, 103, 105, 190 


E 

Eccentricity of vertex, 46 
Edge, 1 
adjacent, 177 
backward, 389 
capacity of, 79, 385 
covering, 182-183 
current, 357 
directed, 195 
edge-current vector, 358 
edge digraph, 236, 408 
forward, 389 
gain, 418 

incident into a vertex, 195 

incident out of a vertex, 195 

initial vertex of, 195 

isomorphism, 87 

listing, 271 

parallel, 2 

pendant, 183 

sequences of, 160, 222, 333 
series, 9 

terminal vertex of, 195 
train, 20 (see also Walk) 
variables, 357 
voltage, 357 
weight of, 61 

Edge, connectivity, 75 
Edge covering, 182-183 
Edge current, 357 
Edge digraph, 236, 408 
Edge gain, 418 


Edge isomorphism, 87 

Edge listing, 271 

Edge train, 20 (see also Walk) 

Edge variables, 35 7 
Edge voltage, 357 
Edge-current vector, 358 
Edge-disjoint Hamiltonian circuits, 32 
Edge-disjoint subgraphs, 17 
Edge-disjoint union of circuits, 115, 212 
Edge-disjoint union of cut-sets, 71 
Edge-voltage vector, 358 
Electrical network: 

application of graph theory to, 356-383 
as flow problems, 399 
use of computers in, 268 
use of incidence matrices in, 271 
Elementary reduction, 99 
Elementary tree transformation (see Cyclic 
exchange) 

Embedding: 
of graph, 90 
on sphere, 94 
Enumeration: 
of digraphs, 263 
of multigraphs, 262 
of simple graphs, 260 
of trees, 52, 240-250 
Enumerator, 243 
Equivalence classes, 201 
Equivalence relation, 239 
Ergodic process, 429 
Error-checking code, 352, 354 
Euler graph, 23, 115, 210 
Euler lines: 

directed, 203, 210, 225, 227 
number of, 205, 226, 238 
in spanning arborescence, 210 
Euler’s formula, 96 
Events: 
critical, 403 
in projects, 400 

Event-vertex representation, 408 
Execution time (see Computation time) 
Exterior region, 94 


F 

Faces (see Regions) 
False vertices, 350 
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Fary’s theorem, 93 
Field, 117 

Figure counting series, 258 
Finite Fields, 119 

Finite-state machines (see Sequential 
machines) 

First Betti number (see Nullity) 

Five-color theorem, 188 
Float of activity, 406 
Flow, 385 
Flow chart, 269 
Flow network, 384, 398 
Flow problem, 384-399 
matching problem, as, 182, 396 

use of computers in, 268 
Forest, 55 

Fortran Extended Graph Theoretic 
Language (FGRAAL), 317 
Forward calculation, 405 
Four-color conjecture, 10, 187-190 
Fragments: 
definition, 202 
finding all, 312 
in program segmentation, 443 
Free trees: 
definition, 48 
number of unlabeled, 248 
Fronds, 303 

Full symmetric group, 253 
Function: 
definition, 256 
equivalence classes of, 256 
Fundamental circuits: 

algorithm, 280-284 
and cut-sets, 73 
definition, 57, 71 
for digraph, 212 
matrix, 144 

application in electrical networks, 359 
deriving of, 144 
for digraph, 219 
Fundamental cut-sets: 

definition, 71 
for digraph, 212 
matrix, 153 

relationship with other matrices, 153 
in synthesis of contact networks, 336- 
339 

Fusion of vertices, 28, 274 


G 

■Galois field: 
modulo m, 118-119 
modulo 2, 138 
Game, 409-413 
comparison with puzzle, 409 
digraph of, 410 
finite, 409 

perfect-information, 409 
states in, 410 
two-person, 409 
Generating functions, 241 
Geometric dual of a graph, 103 
Geometric representation of a graph, 89 
Graph Information Retrieval Language 
(GIRL), 317 

Graph Algorithm Software Package (GASP),' 
316 

Graph Extended Algol (GEA), 317 
Graphs: 

arbitrarily traceable, 29 
as data structures, 448 
bipartite, 168 
bichromatic, 166 
circuit-free, 55 
in coding theory, 351-353 
complement of, 56, 76 
complete, 32 
complete bipartite, 192 
connected, 21 
decomposition of, 26 
definition, 1 
directed (see Digraph) 
disconnected, 21 
drawing of, 2 
equivalence, 200 
Euler, 23, 28 
finite, 7 

in game theory, 409-413 
general, 2 
infinite, 7 

isomorphic, 14, 139 
Kuratowski, 90, 93 
labeled, 53 
linear, 1 
nonplanar, 90 
nonseparable, 151 
null, 9 

nullity of, 57, 60 
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operations on, 26 

oriented, 195 (see also Digraph) 

planar, 90 

rank of, 57 

regular, 8 

“rigid”, 209 

ring sum of, 26 

separable, 142 

self dual, 107 

signal-flow, 416-423 

similarity, 455 

simple, 2 

stochastic, 426 

subspaces of, 133 

transition, 426 

tree, 60 

two-connected, 83 
union of, 26 
unicursal, 24 
uniquely colorable, 172 
universal, 32 
unlabeled, 53 
number of, 239 
vertex, 9 

weighted, 34, 61-63 
Graph-theoretic algorithms, 269-316 
performance, 270 

Graph-theoretic languages, 316-317 
GRASPE, 317 
Gray codes, 351 
Group: 
abelian, 114 
definition, 113 
permutation, 250 
of subgraphs, 115 

Graph Theoretic Programming Language 

(GTPL), 316 

H 

Hamiltonian circuit, 30-34 
number of, 268 
origin of, 10 
Hamiltonian path, 30-34 
finding, in a graph, 312 
shortest, 63 (see also Traveling salesman 

problem) 

Hamming distance, 349 
Height of a tree, 50 

Heuristic procedure, 310 


HINT, 317 

Homeomorphic graphs, 100 
Huffman graph-theoretic codes, 352 


Identification of chemical compounds, 449 

Identity element, 113, 116 

Identity permutation, 252 

Immediate successors, 272 

Impedance matrix, 371 

Incidence, 7 

Incidence matrix, 137-140 
for digraph, 214 
in electrical networks, 359 
as input in algorithms, 271 
rank of, 140 
reduced, 141, 214 

relationship with other matrices, 161 
in synthesis of contact networks, 336 
In-degree, 195, 287, 400 
Independence number, 170-171 
Independent circuits, 144 
Independent set of vertices, 169-170 
Independent set of edges, 193 
Infinite graph, 7 
Infinite region, 94 
Information retrieval, 449, 454 
Instant Insanity, 18 
Intermediate vertices, 386 
Internal states, 342 
Internal vertices, 49 
Internally stable set, 169 
Intersection of graph, 26 
Intersection of subspaces, 131 
In-tree, 207 (see also Arborescence) 
In-valence {see In-degree) 

Invariant of a graph, 311 
Inward demidegree {see In-degree) 

Isograph {see Balanced digraph) 

Isolated vertex, 8 
Isomorphic graphs, 14, 139 
Isomorphic digraphs, 196 
Isomorphism, 14, 53, 159, 209, 239, 274, 

284, 310, 451 

J 

Join, 132 

Jordan curve theorem, 91, 189 
Jordan’s method of elimination, 337 
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K 

7C-chromatic graph, 166 
k -connected graph, 78 
Kernel, 411 
Kirchhoff matrix, 223 
Kirchhoffs current law, 358, 441 
Kirchhoff s voltage law, 359 
Konigsberg bridge problem, 3,23 
Kruskal’s algorithm, 62, 280 
Kuratowski graphs, 90, 93, 341 
Kuratowski’s theorem, 100 


L 

Labeled graph, 53 
Labeled trees, 240 
Latin square, 193 
Level of vertex, 49 
Line digraph, 236 
Line: 

Euler, 23 
unicursal, 24 
(.See also Edge) 

Linear combination, 123 
Linear complex, 3 {see also Graphs) 
Linear dependence, 123 
Linear programming, 216, 386 
Linearly independent, 123, 216 
Linguistics, graphs in, 456 
Link (see Chord) 

Longest-path analysis, 301 
Loop: 

definition, 1, 21 
in electrical networks, 360 
Loop impedance matrix, 371 
Lossy networks, 392 
Lower bound on edge capacity, 392 
Lumped physical systems, 455 


M 

Map coloring, 187 
Map-construction approach, 304 
Markov chain, 425 
Markov process, 424-439 
asymptotic behavior, 433 
definition, 424 
periodic, 431 


transient analysis, 437 
with transient states, 432 
Marriage problem, 180 
Mason’s gain formula, 421-422 
Matching, 177-182 
in bipartite graphs, 182 
definition, 177 
maximal, 178 
perfect, 186 
Matching number, 178 
Matching problem (see Assignment 
problem) 

Matrix: 

adjacency, 157-159 
circuit, 142-143 
cut-set, 151, 153 
incidence, 137, 139 
inversion, 455 
relation, 201 

representing a graph, 137 
stochastic, 425 
transition, 425 
transmission, 332 
weight, 61 

Max-flow min-cut theorem, 86, 387-388 
Maximal complete subgraph, 312 
Maximal flow, 312, 385-386 
Maximal matching, 178, 312 
Maximal planar graph, 111 
Maximal strongly connected subgraphs 
{see Fragments) 

Maxwell’s formula, 366 
Meshes {see Regions) 

Method of paired comparisons, 227 
Minimal cost flow, 393-395 
Minimal covering, 184, 328 
Minimal decyclization, 232, 313 
Minimal spanning tree, 61, 277-279 

Minimum-feedback arcs, 232 
Monoid, 113 

Multicommodity flow, 395-396 
Multiple sources and sinks, 390 

N 

Network analysis problem, 305 
Network: 
activity, 400-409 
contact, 329-341 
electrical, 5, 356-381 
in planning and scheduling, 400 
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synthesis of, 334 
transport, 384-389 
Network flows, 79 
Network functions, 370 
Nim, 410 

Nodal analysis, 362 
Node (see Vertex) 

Node admittance matrix, 363 
Node-removal method, 334 
Node voltages, 361, 370 
Nonplanar graph, 90, 306, 341 
Nonpolynomial algorithms, 315 
Nonseparable graph, 76, 284 
Null graph, 9, 122 
Nullity of a graph, 57, 60 
Number of different arborescences, 238 
Number of different directed Euler lines, 
238 

Number of free unlabeled trees, 248 
Number of labeled graphs, 239 
Number of labeled trees, 240 
Number of rooted labeled trees, 241 
Number of rooted unlabeled trees, 243 
Number of unlabeled graphs, 239 


O 

1-connected graph, 78 
1-factor (see Matching, perfect) 
1-isomorphic, 81 
One simplex, 3 (see also Edge) 
Operation on graphs, 26 
Operations research, graphs in, 384-414 
Optimal-policy matrix, 298 
Ordered trees, 209 
Orientation of graph, 195-196 
Orthogonal complements, 132 
Orthogonal vectors, 130 
Otter’s formula, 249 
Out-degree, 195, 287, 343,400 
Outer region (see Infinite region) 
Out-tree, 207 (see also Arborescence) 
Out-valence (see Out-degree) 

Outward demidegree (see Out-degree) 

P 

Pair group, 255 
Palm tree, 303 


Parallel edges, 2, 271, 401 
Parallel processing design, 448 
Parenthesis-free notation (see Polish 
notation) 
p-partite, 168 
Partitioning algorithm, 313 
Partitioning problem, 165 
Partitions, 243 
Passive edges, 363 
Path: 

critical, 403 

compared with walk and circuit, 21 
directed, 201 
Hamiltonian, 31 
length of, 20 
Path length, 51 
Path matrix, 156, 336 
Path product, 330 
Pafii-finding algorithm, 273 
Paton’s algorithm, 281 
Pendant vertex, 9, 43, 196 
Performance of graph-theoretic algorithms, 
314 

Permutation, 250-255 
degree of, 25 3 

Permutation group, 250, 253, 256 
Persistent state, 344 
PERT, 232, 268, 400-409 
Planar graph, 90, 108, 165 
Planarity testing algorithm, 99, 304-310 
Plane representation, 90, 95, 97, 273 
Planning and scheduling of networks, 
400-409 

Point (see Vertex) 

Polish notation, 208 

Polya’s counting theorem, 238, 250, 257- 
264 

Polynomial-bounded algorithms, 314 
Precedence matrix, 220 
Precedence relationship, 400 
Predecessor matrix, 220 
Preference graph, 227 

Primitive connection matrix, 332 
Prim’s algorithm, 62, 279 
Probability vector, 426 
Program: 

error detection in, 441 
optimization of, 448 
segmentation of, 443 
Program block, 440 
Program digraph, 440, 445 
Project cost curve, 406 
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Proper coloring, 165-168 
definition, 165 
of edges, 177 
of regions, 186 


Q 

Quadratic flow-cost function, 399 


R 

Radius of a tree, 48 
Random digraph, 296 
Random graph, 278, 321 
Random processes, 424 
Random walk, 427 
Randomly generated graph (see 
Random graph) 

Rank : 

of graph, 57, 60 
of incidence matrix, 214 
Ranking by Hamiltonian path, 228 
Ranking by score, 228 
Ranking with minimum violations, 229 
Reachability algorithm, 300 
Reachability matrix, 235 
Reachable vertex, 203 
Realizability: 
of a circuit matrix, 341 
of matrices, 162 

of a single-contact function, 335, 340 
Reduced incidence matrix, 153, 339 
Reference vertex, 214 
Reflected binary code, 351 
Regions, 93 
adjacent, 187 
coloring, 187 
Regular graph, 92 
Regular Markov process, 430 
Regularization of planar graph, 189 
Relation, 198-201 
digraph of, 220 
equivalence, 200 
matrix of, 201 
reflexive, 199 
symmetric, 199 
transitive, 200 
Relay contact, 329 
Ring, 117 


Ring sum, 26 
of circuits, 115, 212 
of cut-sets, 72 
RLC network, 362 
Rooted tree, 48, 241, 243 
number of unlabeled, 243 
Running time, 439, 441 


S 

s-field (see Skew field) 

Scaffolding (see Spanning tree) 

Search techniques, 271 

Seating problem, 6, 32 

Second-shortest path, 301 

Self-dual graphs, 107 

Self-loop, 1, 195, 271 

Semicircuits, 202, 212 

Semigroup, 113 

Semipath, 201 

Semiwalk, 201 

Separable graph, 76, 284 

Sequential circuits, 342 

Sequential machines, 165, 194, 342, 344 

Series edges, 99 

Set: 

of basic circuits, 107 
definition, 112 
empty, 112 
null, 112 

with one operation, 112-116 
with two operations, 116-119 
Shift register, 205 

Shortest-distance arborescence, 294 
Shortest-distance tree, 294 
Shortest-path algorithms, 290 
Signal-flow graph, 194, 416-423, 436 
Signal transmission network, 418 
Single-contact network, 335 
Sink, 385 

Skeleton of graph (see Spanning tree) 
Skew field, 117 
Slack: 
free, 406 
total, 406 

Snake-in-the-box code, 352 

Sociograms, 456 

Source, 385 

Source vertices, 418 

Spanning arborescence, 209, 303 

Spanning forest, 55, 146, 277 
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Spanning tree: 
algorithm for, 277-279 
all, in a graph, 55, 58, 238, 277, 280, 376 
application to electrical networks, 356, 
359 

computer running time, use in estimating, 
442 

definition, 55, 73, 209, 277 
degree-constrained shortest, 63 
minimal, 61, 279 
number of, 218 
root of, 281 

shortest (see Minimal spanning tree) 
sign of, 218, 376 
weight of, 61 

Spanning tree matrix, 164 
Sparse graph, 300 
Sparse matrix, 271 
Star graph, 184 
Start vertex, 441 
Starting state, 344 
State: 

absorbing (see Closed state) 

closed, 429 

ergodic, 429 

persistent, 344 

transient, 432 

trapping (see Closed state) 

State assignment problem, 346 
State diagram (see State graph) 

State equivalence, 345 
State graph: 
definition, 342 
properties, 343 
reduction of, 347 
State table, 342 
Static flow, 385 
Stationary process, 425 
Steady-state probabilities, 434 
Steiner tree, 313 
Stereographic projection, 95 
Stochastic graph, 426 
Stochastic matrix, 425 
Stochastic program-digraph, 445 
Stochastic system, 425 
Stochastically independent transition 
probabilities, 427 
Stop vertex, 441 

Storage requirement of program, 439 
Strongly connected, 202, 203, 222, 312 
Structural isomers, 53 


Subgraph, 16,21, 141,273 
Submatrix, 141 
Subnetworks, 408 
Subsequence, largest monotonically 
increasing, 44 

Subset, 112 
Subspace, 125 
Successor listing, 272 
Supersink, 390 
Supersource, 390 
Switching function, 184, 329 
Switching network, 146, 271, 328 
Sylvester’s law, 146, 152, 460 
System graph, 380, 455 


T 

Teleprinter’s problem, 204-205 
Terminal vertex of path, 20 
Thickness, 109 
Three-terminal devices, 373 
Through variable, 357 
Tie (see Chord-set) 

Time-invariant process, 425 
Topological order, 402, 443 
Topological sorting, 231, 313, 402 
Tournament, 197, 227-230 
Transient vertices, 446 
Transition function, 342 
Transition matrix, 220, 425, 444 
Transition probabilities, 425, 427 
multistep, 427 

Transitive closure of digraph, 300 
Transitivity (see Relation) 

Transmission, 331 
Transmission matrix, 332 
Transport network, 384-389 
Transportation problem, 393 
Trapping state (see Closed state) 

Traveling salesman problem, 34, 280, 313 
Tree, 39-54 
binary, 48 
centers in a, 45 
central, 60 
decision, 41 
diameter of, 48 
in digraphs, 206-211 
external path length of, 51 
family, 41 
free, 48 
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height of, 50 
labeled, 54 
null, 39 

number of, 238 
ordered, 209 
path length of, 51 
radius of, 48 
rooted, 48 

sorting (see Decision tree) 
shortest-distance, 294 
spanning, 55 
unlabeled, 54 

Tree admittance product, 366 
Tree graph, 60 
Tree pairs, 379 
Tree-felling procedure, 280 
True vertices, 350 

Tutte’s map-construction method, 108 
Two-connected graphs, 83 
Two-chromatic graph, 166-167 
Two-isomorphic graphs, 104, 143, 336 
Two-person games, 409 
Two-terminal contact network, 334 
Two-tree, 368 


U 

Unicursal graph, 24 
Unicursal line, 24 
Unimodular matrix, 214, 380 
Union of graphs, 26 
Unique code for graph, 451 
Unique embedding, 98 
Uniquely colorable graphs, 172 
Uniqueness of dual graphs, 103 
Universal graph, 32 
Unlabeled graphs, number of, 239 
Utilities problem, 4, 88 

V 

Valency (see Degree) 

Vector: 

definition, 120 
orthogonal, 130 

Vector space, 120-121 
application in analysis of networks, 359 
of graph, 121-122 


Vertex, 1 
closing, 410 
degree of, 7 
eccentricity of, 46 
end, 9 
even, 22 
forbidden, 288 
Vertex, fusion of, 28 
intermediate, 20 
internal, 49 
isolated, 8 
label of, 15 
level of, 49 

merged (see Vertex, fusion of) 
odd,22 

pendant, 9, 43, 196 
reference, 141 
starting, 410 

Vertex coloring, 165-169, 187 
Vertex connectivity, 75, 78 
Vertex cover, 193 

Vertex graph, 9 (see also Null graph) 
Vertex-disjoint subgraphs, 17 
Vertex-edge incidence matrix (see 
Incidence matrix) 
Vertex-labeling process, 390 
Violation in ranking, 229 
Vulnerability, 77, 284 

.W 

Walk, 19-21 
closed, 20 

compared with path and circuit, 21 
different types of, 35 
directed, 201 
open, 20 
Weight: 
of edge, 61 
of spanning tree, 61 
of subtree, 248 
of vertex, 248 

Weight matrix, 61, 273, 418 
Weighted graph, 61-63 
complete, 34 

Whitney’s theorem, 98, 106 
Windows (see Regions) 
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